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Chapter 1 
Introduction 



In this thesis, we consider codes of length n over an alphabet of q symbols. We give new 
upper bounds on the maximum size Aq{n, d) of codes with minimum distance d, and new 
lower bounds on the minimum size Kq{n,r) of codes of covering radius r. The bounds 
are based on semidefinite programming and on an explicit block diagonalisation of the 
(non-commutative) Terwilliger algebra of the Hamming scheme. Our methods can be seen 
as a refinement of Delsarte's linear programming approach and are related to the theory 
of matrix cuts. They build upon the recent work of Schrijver [38] for binary codes. 

1.1 Codes 

A code is a collection of words of some fixed length, for example the collection of all 
six letter words in a dictionary. However, the words need not have any meaning. They 
are merely concatenations of symbols chosen from a fixed set called the alphabet. Other 
examples of codes are: all possible strings of eight binary digits, a set of bets in a football 
pool, or a collection of DNA sequences. Here the alphabets are {0, 1}, {Win, Lose, Tie} 
and {A, C, T, G} respectively. 

It is often important to know how similar two words are. This can be measured by 
their Hamming distance. By definition, this is the number of positions in which the two 
words differ. Suppose for example that we want to transmit information over a noisy 
communication channel. The letters in a transmitted word each have some small chance 
of being changed into a different letter. At the receiving end, we would like to be able to 
recover the original message (if not too many letters are erroneous). This can be achieved 
by using a code in which any two words have distance at least d = 2e + l for some integer 
e. If we only transmit words belonging to this code, it is always possible to recover the 
sent code word if at most e errors are introduced during transmission. The received 
word is interpreted as the code word that is the closest match. If we aim for a highest 
possible information rate, we should maximize the number of words in the code, under 
the condition that any two code words have distance at least d. Geometrically, this means 
that we want to pack a maximum number of spheres ('balls' would be more accurate) of 
radius r inside the Hamming space consisting of all g" words of length n. Here the chosen 
code words correspond to the centers of the spheres. This leads to the following central 
question in coding theory. 

1 
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What is the maximum cardinality of a code of word length n, in which 
any two words have distance at least d ? 

When the alphabet consists of q symbols, this maximum is denoted by Aq{n^d). The 
number Aq{n,d) can also be seen as the stability number of a graph. Let G be the graph 
with the set of all g" words as vertices, and two words are joined by an edge if their 
distance is less than d. Then the maximum size of a set of vertices, no two of which are 
joined by an edge, equals Aq{n, d). 

The problem of determining Aq{n, d) is hard in general and we will have to be satisfied 
with lower and upper bounds. One major field of research is to find explicit examples 
of (families of) good codes. In this thesis we will address the converse problem and 
give upper bounds on the numbers Aq{n,d). In the case d = 2e + 1 the geometric idea 
of packing spheres already gives an upper bound. Since the spheres are disjoint, their 
'volumes' should add up to a number that is at most g". This gives an upper bound on 
Aq{n, d) called the sphere packing bound. 

1.2 The Delsarte bound 

Currently, many of the best bounds known, are based on Delsarte's linear programming 
approach [T3]. When viewed from the right perspective, the work in this thesis can be 
seen as a refinement of this method. Let us give a very rough sketch of Delsarte's method. 
As is often the case in mathematics, we first seem to make the problem harder. Instead 
of optimizing the cardinality of a code directly, we associate to each code C a symmetric 
matrix of which the rows and columns correspond to all g" possible code words. The 
matrix is constructed by putting a 1 in those positions where both the row and the 
column of the matrix belong to C, and a in all other positions. The size of the code 
can be recovered from the matrix by dividing the total number of ones by the number of 
ones on the diagonal. Although we have no good grasp of the set of matrices that arise 
this way, they share some important and elegant abstract properties: 

• the matrix has zeros in positions indexed by a row and column that are at distance 

l,2,...,c?-l, 

• the matrix is positive semidefinite: it has no negative eigenvalues. 

We enlarge our set of matrices from those associated to codes, to include all symmetric 
matrices sharing the two given properties. The resulting relaxation is much 'smoother' 
and has a very clear description which allows more efficient optimization. Of course 
the magical part is, that optimizing over this larger set gives a good approximation of 
the original problem! This bound was given in the more general setting of bounding 
the stability number of a graph by Lovasz [31]. It can be calculated using semidefinite 
programming in time bounded by a polynomial in the number of vertices of the graph. 

In the coding setting, this will not suffice since the size of the matrices is prohibitively 
large. Even for codes of length n = 20, we have to deal with matrices of more then a 
million rows and columns. However, the problem admits a very large symmetry group. 
It turns out that we can use these symmetries to our advantage to — dramatically — 
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reduce the complexity of the problem. We may restrict ourselves to only those matrices, 
that are invariant under the full group of symmetries. These matrices live in a low- 
dimensional commutative subalgebra called the Bose-Mesner algebra of the Hamming 
scheme. Diagonalising this algebra reduces the huge optimization problem to a simple 
linear program of only n variables! The resulting linear programming bound (adding the 
constraint that the matrix is nonnegative) is due to Delsarte. 

1.3 Overview of the thesis 

In this thesis, we give tighter bounds for codes by essentially isolating more properties 
satisfied by the zero-one matrices associated to a code. More accurately, we associate to 
each code C two matrices. Both matrices are obtained by summing zero-one matrices 
corresponding to certain permutations of the code C. This allows to include constraints 
that come from triples of code words, instead of pairs. This method was initiated recently 
by Schrijver [38] to obtain bounds for binary error correcting codes, resulting in a large 
number of improved upper bounds. 

The main result in this thesis is to generalize the methods to include non-binary 
codes. The primary issue that we need to deal with, is how to exploit the remaining 
symmetries to obtain a semidefinite program of a size that is polynomially bounded by 
the word length n. This is the most technical part of the thesis and requires an explicit 
block diagonalisation of the Terwilliger algebra of the nonbinary Hamming scheme. Such 
a block diagonalisation is described in Chapter [3j It uses the block diagonalisation of 
the Terwilliger algebra of the binary Hamming scheme found by Schrijver, which we will 
describe as well. 

In Chapter |4] we apply our methods to obtain a semidefinite programming bound for 
nonbinary codes. Computationally, we have found a large number of improved upper 
bounds for q = 3,4, 5, which we have tabulated in the final section. 

In Chapter [5] we discuss covering codes. The problem here is to cover the Hamming 
space with as few spheres as possible. When the spheres have radius r, this minimum 
number of required spheres is denoted by Kq{n,r). We give new linear and semidefinite 
programming bounds on Kq{n, r). For g = 4, 5 we obtain several improved lower bounds 
on Kq{n,r). 

In Chapter |6] we relate our coding bounds to the general theory of matrix cuts for 
obtaining improved relaxations of 0-1 polytopes. It is shown that the bound for error 
correcting codes is stronger than the bound obtained from a single iteration of the A^+ 
operator applied to the modified theta body of the graph on all words in which two words 
are joined by an edge if they are at distance smaller then d. 
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Chapter 2 



Preliminaries 



This thesis is largely self-contained and most results are derived from explicit construc- 
tions. However, some theory is desirable for putting them into the right perspective and 
relating them to the body of mathematics to which they connect. In this chapter we 
give some definitions and basic facts. After giving some general notation in the first 
section, we introduce matrix *-algebras in the second section, which are an important 
tool throughout the thesis. The main (classical) theorem says (roughly) that any matrix 
*-algebra is isomorphic to a direct sum of full matrix *-algebras. In the third section we 
describe semidefinite programming. The bounds we derive for codes, are defined as the 
optimum of certain semidefinite programs and can be computed efficiently. Finally, we 
recall the basics of association schemes. In particular we describe the Delsarte bound on 
the maximum size of cliques in association schemes. 



2.1 Notation 

For positive integers n, m and a set R (usually i? = C, M), we denote by _R"^™ the set of n 
by m matrices with entries in R and by i?" the set of (column) vectors of length n. When 
i? is a ring, we define matrix addition and multiplication of matrices (with compatible 
dimensions) as usual. Frequently, the rows and columns correspond to the elements of 
some given finite sets X and Y. When we want to explicitly index the rows and columns of 
the matrix using these sets, we will write R^^^ for the set of matrices with rows indexed 
by X and columns indexed by Y . The i-ih. row of a matrix A is denoted by Ai and the 
entry in row i and column j by Ajj. The transpose of an X x F matrix is the F x X 
matrix A""", where Aj, = Aj^i for i E Y ^ j E X. When |y| = 1, we often identify the 
matrices in R^^^ and the vectors in R^ . 

For finite sets X,Y, the all-one vector in R^ is denoted by 1. The X xY all-one 
matrix is denoted by J, the all-zero matrix by and the X x X identity matrix by I. 
The sets X and Y will be clear from the context. 

Given a matrix A G R^^^ , we define diag(A) to be the vector a G R^ of diagonal 
elements of A, that is a^ := Ai^i for i G X. The trace of A is the sum of the diagonal 
elements of A and is denoted tiA. So tiA = l"''diag(A). We mention the useful fact that 
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for matrices A G i?^^' and B G R''^^ the following identity holds: 

tT{AB) = tT{BA). (2.1) 

Given a vector a G R^ , we denote by Diag(a) the diagonal matrix A G R^^^ with 
diag(A) = a. 

For a subset S" C X we denote by x^ the vector in R^ definied by 

ix%:=l' '''^^ (2.2) 

I otherwise. 

For a vector x and a set S, we define 

:(5):=5^x,. (2.3) 



x[ , 



For a matrix A G C , the conjugate transpose of A is denoted by A*. That is 
A* J = Aj^i for i G X and j G F, where z is the complex conjugate of a complex number 
z. A square matrix A is called normal if ^4*^4 = AA*, hermitian ii A* = A and unitary if 
AM = AA* = I. 

For A, 5 G C^^^, we define 

{A,B):=tTiAB*)= Y, AjA7- (2-4) 

iexjeY 

This is the standard complex inner product on C^^^. Observe that 

(A,J) = 1^A1. (2.5) 

For matrices A G C"^^^^^ and B G C'''"^^^^, we denote by A (g) i? the tensor product of 
A and i? defined as the {Xi x X2) x [Yi x 1^2) matrix given by 

(A (g) i?)(j^j/)^(jj/) := Ajj_Bj/j/. (2.6) 

2.2 Matrix *-algebras 

In this section we consider algebras of matrices. For general background on linear algebra 
we refer the reader to [221 121! • 

A matrix *- algebra is a nonempty set of matrices A C C"^" that is closed under 
addition, scalar multiplication, matrix multiplication and under taking the conjugate 
transpose. A matrix *-algebra is a special case of a finite dimensional C*-algebra. Trivial 
examples are the full matrix algebra C"^" and the zero algebra {0}. 

Most of the matrix *-algebras that we will encounter in this thesis are of a special type. 
They are the set of matrices that commute with a given set of permutation matrices. More 
precisely, we have the following. 
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Let G C S'„ be a subgroup of the symmetric group on n elements. To every element 
a E G we associate the permutation matrix M^ G C"^" given by 

iM.)..,:-ll '':<^' = '' (2.7) 

10 otherwise. 

Observe that 

m: = m; = m.-i. (2.8) 

The map a h-)- M^- defines a representation of G. This means that for all a,T E G we have 

Mra = MrM„ and M^-i = M'^. (2.9) 

Here ra denotes the permutation (rcr)(z) := r(cr(z)). We define the centralizer algebra 
(see [2]) of G to be the set A of matrices that are invariant under permuting the rows and 
columns by elements of G. That is, 

A:={Ae C"^" I M-^AM„ = A for all a e G}. (2.10) 

If we denote by B the matrix *-algebra spanned by the set of permutation matrices 
{Mo-, ex G G}, then A is also called the commutant algebra of B: the algebra of matrices 
that commute with all the elements of B. To see that the set A is indeed a matrix *- 
algebra, we first observe that it is closed under addition and scalar multiplication. That 
A is closed under matrix multiplication and taking the conjugate transpose follows from 

M-^ABM^ = M-^AMM~^BM„ = AB, (2.11) 

M;^A*M^ = {M-^AM^y = A* 

for any A,B E A. 

One of the special features of A is that it contains the identity and is spanned by 
a set of zero-one matrices whose supports partition {1, . . . ,n} x {1, . . . ,n} (that is, it 
is the algebra belonging to a coherent configuration, see [12] )• These matrices have a 



combinatorial interpretation. Indeed, from (2.10) it follows that 



A E A ii and only if Aij = y4o-(i),cr(j) for all i,j G X. (2-12) 

Hence A is spanned by zero-one matrices Ai, . . . ,At, where the supports of the Ai are the 
orbits oi {1, . . . ,n} x {1, . . . , n} under the action of G, called the orbitals of G. 

The following structure theorem is one of the main motivations for this thesis. It allows 
to give a matrix *-algebra a simple appearance by performing a unitary transformation 
(a block diagonalisation). We will not use this theorem, but rather give explicit block 
diagonalisations for the matrix ^-algebras under consideration. 

Theorem 1. Let A C C"^" ^e a matrix *-algebra containing the identity matrix I. Then 
there exists a unitary nxn matrix U and positive integers pi, . . . ,pm and gi, . . . , g^ such 
that U*AU consists of all block diagonal matrices 

/Gi • • • \ 

G2 ■■■ 

. . (2.13) 

: : ■. 

■•• G„ 
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where each Ck is a block diagonal matrix 



(2.14) 



fBk • • • \ 
Bk ■■■ 

: : ■■. 

\0 ■■■ BkJ 

with qk identical blocks Bk G C^'^^P'' on the diagonal. 

Observe that the numbers pi, . . . ,pm and qi, . . . ,qm satisfy 

qiPi + q2P2^ \-qmPm = n, (2.15) 

Pi+pI + '-'+pIi = dim A 

We call the algebra U*AU a block diagonalisation of A. This theorem was proved in [3] 
by using (a special case of) the Wedderburn-Artin theorem (see also [15], [35]). However, 
we will present a self-contained proof here. 

A well-known instance is when A C C"^" is commutative. This occurs for example 
when A is the Bose-Mesner algebra of an association scheme. In the commutative case we 
must have pi = . . . = Pm = 1, since for any p > 2 the algebra C^^p is non-commutative. 
The theorem then says that the matrices in A can be simultaneously diagonalised: 

U*AU = {Xih +X2h + ---+ Xmlm \ X G C"}, (2.16) 

where for each k the matrix Ik G C" is a zero-one diagonal matrix with qk ones, and 
Ii + ■ ■ ■ + Im = I ■ The rest of this section is devoted to proving Theorem [l] 

We first introduce some more notation. For two square matrices A G C"^"" and 
B G C"'^™, we define their direct sum A®B e C("+™)><("+™) by 

A^B:=(^ll). (2.17) 

For two matrix *-algebras A and i3, we define their direct sum by 

A®B:={A®B\AeA,B eB}. (2.18) 

This is again a matrix *-algebra. For a positive integer t, we define 

tQA:={tQA\AeA}, (2.19) 

where tQ A denotes the iterated direct sum ®\^^A. 

We call two square matrices A.,B & (j^nxn ^q^niyalent if there exists a unitary matrix U 
such that B = U*AU. We extend this to matrix *-algebras and call two matrix *-algebras 
A and B equivalent if i3 = U*AU for some unitary matrix U . 

Theorem [l] can thus be expressed by saying that every matrix *-algebra A containing 
the identity matrix is equivalent to a matrix *-algebra of the form 



0(g,0C^^^PO- (2.20) 



j=i 
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We start by considering the commutative case. We first introduce some more notions. 
Let A C C"^" and let V C C" be a linear subspace. We say that V is A-invariant when 
Av G V for every A E A and every v E V. Observe that if A is closed under taking the 
conjugate transpose, also the orthoplement 

V^ := {veC^l {v, u) = for all u e V} (2.21) 

is ^-invariant. Indeed, for every u & V, v & V-^ and AEAwe have 

{Av,u) = {v,A*u) = 0, (2.22) 

since A*u G V. 

A nonzero vector f G C" is called a common eigenvector for A when Cv is ^-invariant. 
We recall the following basic fact from linear algebra. 

Fact. Let V be a complex linear space of finite, nonzero dimension, and let A : V — > V 
be a linear map. Then there exist A G C and v E V \ {0} such that Av = Xv. 

In particular, this implies that when A G C"^" and V C C" is {A}-invariant, there 
exists an eigenvector of A that belongs to V. We are now ready to prove the following 
theorem. 

Theorem 2. Let A C C"^" be a commutative matrix *-algebra and let V C C"' be an 
A-invariant subspace. Then V has an orthonormal basis of common eigenvectors for A. 

Proof. The proof is by induction on dim V". If all vectors in V are common eigenvectors 
for A, then we are done since we can take any orthonormal basis of V. Therefore we may 
assume that there exists an A E A such that not every f G V^ is an eigenvector for A. 
Since V is {A}-invariant, A has some eigenvector f G V^ of eigenvalue A G C Denote by 

Ex := {x G C" I Ax = Ax} (2.23) 

the eigenspace of A for eigenvalue A. As A is commutative, the space Ex is ^-invariant. 
This follows since for any B E A and any v E Ex 'we have 

A{Bv) = B{Av) = XBv, (2.24) 

and hence Bv E Ex- It follows that also V := V (lEx and V" := V (1 E^ are ^4- invariant. 
By assumption on A, V" has positive dimension, yielding a nontrivial orthogonal de- 
composition V = V (B V". By induction both V and V" have an orthonormal basis of 
common eigenvectors of A. The union of these two bases gives an orthonormal basis of 
V consisting of common eigenvectors of A. D 

Let us consider the special case V := C". Let {Ui, . . . , f/„} be an orthonormal basis 
of common eigenvectors for A and denote by U the square matrix with these vectors as 
columns (in some order). Then f/ is a unitary matrix that diagonalises A. That is, all 
matrices in U*AU are diagonal matrices. 
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Proposition 1. Let A C (C">^" ^e an algebra consisting of diagonal matrices. Then there 
exist zero-one diagonal matrices Ii, . . . Im with disjoint support such that 

A = CIi + --- + CIm. (2.25) 

Proof. Let S := {i & {1, . . . ,n} \ Ai^i ^ for some A E A} be the union of the supports 
on the diagonal, of the matrices in A. Define an equivalence relation on S by calling i 
and j equivalent when Ai^i = Ajj for every A E A, and let 5*1, . . . , 5*^ be the equivalence 
classes. Denote for k = 1, . . . ,m hy Ik '■= Diag(x'^'°) the zero-one diagonal matrix with 
support Sk- The inclusion 

ACCh + --- + CIm (2.26) 

is clear. 

To finish the proof, we show that Ii, . . . ,Im ^ A. It is not hard to see that there is a 
matrix A E A with 

A = Cji + C2/2 + ■ • • + Cmlm, (2.27) 

for pairwise different nonzero numbers Ci, . . . , c^. It then follows that for k = 1, . . . ,m 
we have 

h = -^^, -i- (2.28) 

Since the right-hand side is a polynomial in A with constant term equal to zero, we obtain 
he A. D 

When ^ is a commutative matrix *-algebra containing the identity, and U is a unitary 
matrix diagonalising the algebra, say U*AU = C/i + ■ ■ ■ + dm, then the matrices 

Ek := UhU* E A (2.29) 

form a basis of orthogonal idempotents of A. They satisfy 

Ei + --- + Em = /, (2.30) 

E^ = El 
for i^i G {1, . . . ,m}. Geometrically, we have an orthogonal decomposition 

C" = 1^1 © • • ■ ® Kn (2.31) 

and Ek is the orthogonal projection of C" onto Vk- 

We will now consider the case that the matrix *-algebra A is not necessarily commu- 
tative. We first introduce some terminology. Let A C C"^" be a matrix *-algebra. An 
element E E Ais called a unit of A when EA = AE = A for every A E A. Every matrix 
*-algebra has a unit, see Proposition |3] below. A sub *-algebra of ^ is a subset of A that 
is a matrix *-algebra. An important example is 

Ca--={AeA\AB = BA for all B E A}. (2.32) 
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An ideal of ^ is a sub *-algebra that is closed under both left and right multiplication 
by elements of A. We observe that if X is an ideal of A and E is the unit of X, then 
E G C_4. This follows since for any A E A both EA and AE belong to X and hence 
EA = EAE = AE. 

We say that a commutative sub *-algebra of A is maximal if it is not strictly contained 
in a commutative sub *-algebra of A. We have the following useful property. 

Proposition 2. Let B be a maximal commutative sub *-algebra of the matrix *-algebra 
A and let 

B' ■.= {AeA\AB = BA for all B e B}. (2.33) 

Then B' = B. 

Proof. Clearly B (^ B'. We show the converse inclusion. First observe that for any A E B' 
also A* E B' . This follows since for any B E B we have 

A*B = {B*Ay = (AB*)* = BA*. (2.34) 

Next, we show that B contains every A E B' that is normal (that is A A* = AA*). This 
follows since for any normal A E B' \ B the commutative matrix *-algebra generated by 

A, A* and B, strictly contains B. 

Finally, let A E B' he arbitrary. The matrix A + A* is normal, and hence belongs to 

B. It follows that A{A + A*) = {A + A*)A, or AA* = A* A, and hence A itself is normal 
and therefore belongs to B. D 

When a matrix *-algebra does not contain the identity, the following proposition is 
useful. 

Proposition 3. Every nonzero matrix *-algebra A is equivalent to a direct sum of a 
matrix *-algebra containing the identity and (possibly) a zero algebra. In particular, A 
has a unit. 

Proof. Let Bhe a, maximal commutative sub *-algebra of A. By diagonalising B we may 
assume that 

-B = C/i + ■ ■ ■ + CJ™ (2.35) 

for diagonal zero-one matrices Jq, h, . . . , Im with / = Jq + /i + ■ ■ • + Im- If -^o = 0, we are 
done. So we may assume that Iq ^ 0. To prove the proposition, it suffices to show that 

I^A = AIo = {0}, (2.36) 

since this implies that A is the direct sum of the algebra obtained by restricting A to the 
support of Ji + ■ ■ ■ + /„ and the zero algebra on the support of Jq. 
First observe that 

IoA = A-{Ii-\ h Im)A E A for every A E A. (2.37) 

Let A E Ahe arbitrary and let 

A' := iIoA)iIoA)* E A. (2.38) 
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Then for k = 1, . . . ,m we have 

hA' = hIoAA*Io = = IoAA*Ioh = A'h. (2.39) 

It follows that A' commutes with Ii, . . . ,Im and hence is a linear combination of Ji, . . . , Jm 
by the maximality of B. On the other hand A'l^ = for k = 1, . . . ,m, and hence A' = 0. 
It follows that also IqA = 0. 

Similarly, by considering A* we obtain I^A* = and hence AIq = 0. D 

We call a nonzero matrix *-algebra A simple if C^ = CE, where E is the unit of A. 
Since the unit of any ideal of A belongs to C^, it follows that if A is simple, it has only 
the two trivial ideals {0} and A. The reverse implication also holds (see Proposition 111) . 

Proposition 4. Every matrix *-algebra A containing the identity is equivalent to a direct 
sum of simple matrix *-algebras. 

Proof. Since C^ is commutative, we may assume it is diagonalised by replacing A by 
U*AU for a unitary matrix U diagonalising C^. Then 

C^ = C/i + ■ ■ ■ + C/„ (2.40) 

where /i, . . . , /^ are zero-one diagonal matrices with Ji + ■■■ + /,„ = /. For every i,j G 
{1, . . . , m} with i y^ j we have 

I.AIj = IJ,A = {0}. (2.41) 

It follows that A is the direct sum 

A = Ai®---®Am, (2.42) 

where for i = 1,. . . ,m the matrix *-algebra Ai is obtained from liAIi by restricting to 
the rows and columns in which /j has a 1. D 

Finally, we show that every simple matrix *-algebra can be brought into block diagonal 
form. 

Proposition 5. Every simple matrix *-algebra A containing the identity is equivalent to 
a matrix *-algebra of the form t C™^*" for some t,m. 

Proof. Let A C C^-x" be a simple matrix *-algebra containing the identity, and let B be 
a maximal commutative sub *-algebra of A. We may assume that B consists of diagonal 
matrices, say 

B = Ch + --- + CIm (2.43) 

where Jj = x^' for z = 1, . . . , m and 5*1 U ■ ■ ■ U 5*.^ is a partition of {1, ... , n}. For every i 
and every A E A the matrix JjA/j commutes with Ji, . . . , /^ and hence, by the maximality 
of B, the matrix liAIi is a linear combination of Ji, . . . , Im- It follows that 

liAIi = C/i for i = 1, . . . , m. (2.44) 
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For any i, the set X := Ah A is a nonzero ideal of A. Hence the unit of X belongs to 
C_A = CJ. It follows that / G X and hence 

liAIj ^ {0} for every ij = l,...,m. (2.45) 

For any A E C"^", and i,j G {1, . • • ,fn}, we denote by Aij G Cl'^^'^l'^-'l the matrix 
obtained from A by restricting the rows to Si and the columns to Sj (and renumbering 



the rows and columns). By (2.45) we can fix an A G ^ with Aij ^ for every z,j G 



{1, . . . , m}. In fact we can arrange that 

tr((A„-)*A.) = I'^.l- (2.46) 

Let i be arbitrary and let A := IiAIi. Then 

A'{A')* is a nonzero matrix in C/i, (2.47) 

{A')*A' is a nonzero matrix in C/j. 

This shows that Ji and Jj have the same rank t, namely the rank of A'. In other words: 



Si\ = \Si\ =t. Moreover by (2.46), the matrices An are unitary since 



(Ai,,)Mi,, = Ai,,(Ai,,)* = /. (2.48) 

Let U := Al^-^ © ■ • ■ ©AJ ,„ G C"^" be the unitary matrix with blocks A^ ^ on the diagonal. 
By replacing A by U*AU we may assume that Aii = I for i = 1, . . . ,m. 
This implies that for any i,j G {1, . . . , m} 

Bi^i = Ai,,5,- 1 = {hAIiBIi)i^i G CJ for any B E A, (2.49) 

and hence 

5,j = 5,,j(A*)j-i = {IiBIjA*h),^i G CJ for any 5 G A (2.50) 

Summarizing, we have 

A={Ae C"^*" I Ai^j G CJ for all i,j G {1, . . . ,m}}. (2.51) 

By reordering the rows and columns, we obtain the proposition. D 

Proposition |4] and [5] together imply Theorem [l] 

2.3 Semidefinite programming 

In this section we introduce semidefinite programming. For an overview of semidefinite 
programming and further references, we refer the reader to |¥T] . 

Recall that a complex matrix A is called hermitian if A* = A. It follows that all 
eigenvalues of A are real. An hermitian matrix A G C"^" is called positive semidefinite, 
in notation A^ 0, when it has only nonnegative eigenvalues. 
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Proposition 6. For an hermitian matrix A G C"^" the following are equivalent: 

(i) A ^ 0, (2.52) 

(ii) xMx > for all x G C", 
(ill) A = B*B for some B G C"^". 

In the case that A is real, we may restrict to real vectors x in (ii) and take B real in (Hi). 

It follows that for positive seniidefinite matrices A,B E C"^" the inner product is 
nonnegative: 

{A, B) = ti{C*CDD*) = ti{CDD*C*) = {CD, CD) > 0, (2.53) 

when A = C*C and B = D*D. Another useful observation is that when A is positive 
semidefinite, every principal submatrix is positive seniidefinite as well. In particular, the 
diagonal of A consists of nonnegative real numbers. Also 

if U is nonsingular, then A^ ii and only if U*AU y 0. (2.54) 

In the remainder of this section, all matrices will be real. A semidefinite program is a 
an optimization problem of the following form, where Ai, . . . ,An,B are given symmetric 
matrices in M"^" and c G M" is a given vector: 

minimize c x (2.55) 

subject to XiAi + ■ ■ ■ + XnAn — B y 0. 

When Ai, . . . , A^, B are diagonal matrices, the program reduces to a linear program. In 



particular, linear constraints Ax < b can be incorporated into the program (2.55) by 
setting 

'A 



and 

where at is the i-th column of A. Semidefinite programs can be approximated in polyno- 
mial time within any specified accuracy by the ellipsoid algorithm ([H]) or by practically 
efficient interior point methods ( |34] ) . 

For any symmetric matrix A G M"^", the matrix R{A) is defined by: 

RiA) := (1 «;) . (2.58) 

where a := diag(A) is the vector of diagonal elements of A. We will index the extra row 
and column oi R{A) by 0. 

The following propositions are helpful when dealing with semidefinite programs that 
involve matrices of the form R{A). 
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Proposition 7. Let A G M"^" be a symmetric matrix such that diag(A) = c-Al for some 
c eM.. Then the following are equivalent: 

(i) R(yA) is positive semidefinite, (2.59) 

(ii) A is positive semidefinite and l^Al > (trA)^. 

Proof First assume that (i) holds. Let R{A) = U'^U, where U G M("+i)x(«+i). Using 
UJUq = 1, we obtain 

n n n 

l-'Al=Y,UlU, = (5^[/,)^(^f/,)-f/o^f/o (2.60) 

n 

1=1 

Here the inequality follows using Cauchy-Schwarz, and in the last equality we use UjUo = 
Ai^i. Next assume that (ii) holds. We may assume that iiA > 0, since otherwise A = and 
hence R{A) is positive semidefinite. Let A = U'^U where U G M"^". Let a := diag(A). 
For any x G M" the following holds: 

x'^Ax > {l'^ Al)-\x'^ Alf (2.61) 

> ^^—x^Al 
- \\^A\ ) 

= c^j^c X a 
I'a 

= {x'^ay. 

Here the first inequality follows by applying Cauchy-Schwartz on the inner product of Ux 
and Ul, and the second inequality follows from the assumption l"'"Al > {tiA)^. It follows 
that for any vector (") with a; G M" and a G M, we have 

{a,x'^)R{A)('^\ = a"^ + 2aa'^ X + x'^ Ax 

> a^ + 2aa^ X + {a^ xY 
= (a + a'^xf > 0. 

n 

This implies the folowing useful equivalence of semidefinite programs. 

Proposition 8. Let C C M^^" he a cone, and assume that the following two maxima 
exist: 

Oi := maxjl^Al | trA = 1, A ^ 0, A G C}, (2.62) 

O2 := maxjtrA | R{A) ^ 0, A G C}. 

Further assume that the maximum in the first program is attained by a matrix A with 
diag(A) = c ■ Al for some c eM.. Then Oi = 02- 
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Proof. Let A be an optimal solution to the first program with diag(y4) = c • Al for some 
c G M, and define A' := {l'^Al)A. Then 

l^A'l = {l^Alf = {tiA')\ (2.63) 

Hence A' is feasible for the second program by Proposition [7J Since trA' = l^Al we 
obtain O2 > Oi. 

Let A be an optimal solution to the second program. If tiA = we have Oi > O2 and 
we are done. Hence we may assume that tiA > 0. Observe that (trA)^ = l"''Al, since 
otherwise we would have l"'^Al = A(trA)^ for some A > 1 by Proposition ul This would 
imply that XA is also feasible, contradicting the optimality of A. Define A' := :^A. Then 
A' is feasible for the first program and 

I'^A'l = -^I'^Al = tiA (2.64) 

This implies that Oi > O2. □ 

An important special case is when all feasible matrices have constant diagonal and 
constant row sum. this occurs for example in semidefinite programs where the feasible 
matrices belong to the Bose-Mesner algebra of an association scheme. Another case is 
when the cone C is closed under scaling rows and columns by nonnegative numbers. 

Proposition 9. Let C C M"^" he a cone of symmetric matrices, such that for any 
nonnegative x G M" and any A & C also Diag(x)ADiag(x) belongs to C . Then any 
optimal solution A to the program 

max{l"^Al I trA = 1, A ^ 0, A G C} (2.65) 

satisfies diag(y4) = c ■ Al for some c E"^. 

Proof. Let A be an optimal solution. If Aj j=0 for some i, we have Aj = and the claim 
follows by induction on n. Therefore we may assume that a^ := \J Ai^i > for i = 1, . . . ,n. 
The matrix A' := (Diag(a))~^y4(Diag(a))'^ is scaled to have only ones on the diagonal. 
Now for every nonnegative x G MJ^ with ||a;|| = 1, the matrix A{x) := Diag(x)74'Diag(x) 



is a feasible solution to (2.65) and has value x'^A'x. By the optimality of A, the vector 
a maximizes x'^A'x over all nonnegative vectors x with ||2;|| = 1. In fact, since a > 0, it 
maximizes x'^A'x over all x with ||x|| = 1. As M" has an orthonormal basis of eigenvectors 
for A', it follows that a is an eigenvector of A' belonging to the maximal eigenvalue A. 
This implies that 

Al = Diag(a)A'Diag(a)l = Diag(a)A'a (2.66) 



1)1 


= Diag(a)A'a 




= ADiag(a)a 




= \{al,...,al 


al,. 


■■,alV- 



This finishes the proof since 

dmg{A) = {al...,a'J\ (2.67) 

n 
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2.4 Association schemes 

In this section, we give some basic facts and notions related to association schemes, 
including Delsarte's linear programming approach for bounding the size of cliques in 
an association scheme. This is by no means a complete introduction to the theory of 
association schemes. For further reading, we recommend [HI HI [IS] on association schemes 
and [10] on the related topic of distance regular graphs. 

Roughly speaking, an association scheme is a very regular colouring of the edges of 
a complete graph. The colouring is such, that the number of walks from a vertex a to 
a vertex b traversing colours in a prescribed order, does not depend on the two vertices 
a and b, but merely on the colour of the edge ab. The following formal definition is due 
to Bose and Shimamoto [8]. A t-class association scheme S = {X, {Rq, Ri, . . . , Rt}) is a 
finite set X together with t + 1 relations Rq, ... ,Rt on X that satisfy the following axioms 

(i) {-Ro) Ri, ■ ■ ■ , Rt} is a partition oi X x X, 

(ii) Rq = {{x,x) I X G X}, 

(iii) (x, y) G Ri if and only if {y, x) G Ri for all x , y E X , i G {0, . . . ,t} , 
(iv) for any i,j, k E {0, . . . ,t} there is an integer pfj such that 

|{2; G X I {x,z) G Ri, {z,y) G Rj}\ = Pij whenever {x,y) G Rk- 

The set X is called the set of points of the association scheme and two points x,y E X 
are said to be i-related when {x,y) E Ri. An association scheme defined as above, is 
sometimes called a symmetric association scheme since all relations are symmetric by 
(iii). Some authors prefer to allow for 'non-symmetric association schemes' by replacing 
condition (iii) by 

(iii') for each i E {0, . . . ,t} there is an z* G {0, . . . , t} such that 

(x, y) E Ri implies {y, x) E Ri* for all x, y G X, 
(iii") pIj = p% for all I, J, /c G {0, . . . , t}. 

(2.68) 

In this thesis we will only use symmetric association schemes. 

The numbers pf ,• are called the intersection numbers of the association scheme. The 
intersection numbers are not free of relations. We mention some obvious relations: 

Pt- = V% (2.69) 

p°j = when i 7^ j. 

The numbers Ui := p?j are called the degrees of the scheme and give the number of points 
that are i- related to a given point (each relation Ri induces an nj-regular graph on X). 
To each relation Ri, we associate the X x X matrix Ai in the obvious way: 

otherwise. 
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The matrices Aq, . . . ,At are called the adjacency matrices of the association scheme and 
allow to study the association scheme using algebraic (spectral) tools. In terms of the 



adjacency matrices, the axioms in (2.68) become 



(i) Ao + Ai + --- + At = J, 
(ii) Ao = /, 
(iii) Ai = AJ for alH G {0, . . . , t}, 



t 



(iv) AiAj = J2 vljAk for any i, j G {0, . . . , t}. 

fc=0 

Let 

A := {xoAo + xiAi + ■■■ + XtAt | xq, . . . , x* G C } (2.71) 

be the linear space spanned by the adjacency matrices. Axiom (iv) says that A is closed 
under matrix multiplication. Since all matrices in A are symmetric, it follows that ^ is a 
commutative matrix *-algebra, which is called the Bose-Mesner algebra of the association 
scheme. Since the adjacency matrices are nonzero and have disjoint support, they are 
linearly independent. This implies that the dimension of A equals t + 1. 

Since the algebra A is commutative, it has a basis Eq, Ei, . . . ,Et oi matrices satisfying 

(2.72) 



(i) 


EiEj = SijEi, 


(ii) 


Eo + ... + Et 


(iii) 


E* = Ei, 



for every i,j G {0, . . . ,t}. The matrices Ei are called the minimal idempotents of the 
algebra and are uniquely determined by A. Geometrically, this means that there is an 
orthogonal decomposition 

C^ = Vo®Vi®---®Vt, (2.73) 

where Ei is the orthogonal projection onto Vi for i = 0, . . . ,t. For each i the dimension 

mi:= dim Vi (2.74) 

equals the rank of Ei. The numbers mo, . . . ,mt are called the multiplicities of the associ- 
ation scheme. 

In general, there is no natural way to order the Ei. However, there is one exception. 
The matrix |X|~^J is always a minimal idempotent, hence it is customary to take Eq : = 
|X|~^J (and Vo = CI, rrio = 1). Since all matrices in A are symmetric, the idempotents 
Ei are real by (|2]72|(iii). 

Since both {Eq, . . . , E^} and {Aq, . . . , A^} are bases for A, we can express every matrix 
in one base as a linear combination of matrices in the other base. The (t + 1) x (t + 1) 
real matrices P, Q are defined as follows: 

t 

^.• = 5]^M^^> (2-75) 

i=0 

t 

l-^l • Ej = y ^QijAi 



i=0 
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for j = 0, . . . , t. The matrices P and Q are called the first and second eigenmatrix of the 

scheme respectively. Indeed, since 

t 

5^qE, (2.76) 

i=0 

has eigenvalue Cj with multiplicity rrii (if the q are different), the i-th column of P gives 
the eigenvalues of Ai. Clearly 

PQ = QP=\X\-I, (2.77) 

but additionally, the matrices P and Q satisfy the following relation 

TTijPj^i = UiQij, for all i,j G {0, . . . ,t}. (2.78) 

In matrix form: 

P"^Diag(mo, . . . , m^) = Diag(no, . . . , nt)Q. (2.79) 

This is a consequence of the fact that both bases {AQ,...,At} and {EQ,...,Et} are 



orthogonal. Indeed, this implies by (2.75) that both the left-hand side and the right-hand 
side in equation (2.78) are equal to {Ai,Ej). 

Given a subset Y C X oi the point set, the distribution vector of Y is the (t + l)-tuple 
(ao, ai, . . . , at) of nonnegative numbers defined by 

a-= |r|-i. |(r xr)ni?i|, i = o,...,t. (2.8O) 

The numbers Oj give the average number of elements in Y that are i-related to a given 
element in Y. In particular ao = 1 and ao + ■ ■ ■ + at = \Y\. Delsarte [15] showed that the 
distribution vector satisfies the following system of inequalities: 

t 
Y,Qi,,ci^>0 forj = 0,...,t. (2.81) 

i=0 

Let K ^ {1, . . . ,t}. A subset S" C X of the point set is called a K-clique if any two 



different elements x,y E K are z-related for some i E K. The inequalities (2.81) yield an 
upper bound on the maximum size of a i^-clique called the Delsarte bound. 

Theorem 3. Let {X,{Rq, . . . ,Rt}) be an association scheme and let K C {l,...,t}. 
Then the maximum size of a K-clique is upper bounded by 

max {ai H h a* | Oq = 1, a^ = /or i G {1,2, . . . , t} \ K (2.82) 

a, > for all z G {1, 2, . . . , t} 



ao, . . . , aj satisfy the inequalities (2.81)}. 



The Delsarte bound can be efficiently calculated using linear programming and often 
gives a remarkably good upper bound. 

One source of association schemes are (permutation) groups. Let G be a group acting 
on a finite set X. Then G has a natural action on X x X given by g{x,y) := {gx,gy). 
The orbits 

{i9x,gy)\geG} (2.83) 
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of X X X are called orbitals. The group G is said to act generously transitive when for 
every pair {x,y) G X x X there is a group element g E G that exchanges x and y, that is 
gx = gy and gy = gx. When G acts generously transitive, the orbitals form the relations 
of an association scheme. 

Indeed, the orbitals partition X x X, for any x E X the orbital {{gx,gx) \ g E G} is 
the identity relation (as G acts transitively on X) and the orbitals are symmetric (since 
G acts generously transitive). Finally, let Ri,Rj,Rk be orbitals and let for {x,y) G Rk 

Z^^y := {2; e X I (x, z) e Ri, {z, y) G i?,}. (2.84) 

We have to show that the cardinality p^ • of Z^y only depends on the relations z, j, k and 
not on the particular choice of x and y. This follows since 

Zgx,gy l^{gz\ze Z.x^y} (2.85) 

for any g E G. In this case, the Bose-Mesner algebra is the centralizer algebra of G. 

Given an association scheme 5* = (X, 7^) with adjacency matrices Af),Ai,...,At G 
C"^^"^, and a point x G X, the Terwilliger algebra of S with respect to x is the complex 
algebra generated by Aq, ..., At and the diagonal matrices E'q, . . . ,E^ defined by 

iEX.-=\l '^ *'"''■ (2.86) 

I U otherwise. 

Observe that E'^^ + ■ ■ ■ + E'^ = I. These algebras were introduced by Terwilliger in |39] 
under the name suhconstituent algebra as a tool for studying association schemes. In this 
thesis we will use the Terwilliger algebra of the Hamming scheme to obtain bounds for 
codes, improving the Delsarte bound. 



Chapter 3 

The Terwilliger algebra of the 
Hamming scheme 



A particular association scheme that plays an important role in the theory of error correct- 
ing codes is the Hamming scheme. In this chapter we will consider this scheme together 
with matrix algebras associated to it. In particular we construct a block diagonalisation 
of the Terwilliger algebra of the binary and the nonbinary Hamming scheme. 



3.1 The Hamming scheme and its TerwiUiger algebra 

Fix integers n > 1 and q >2, and fix an alphabet q = {0, 1, . . . , g — 1}. We will consider 
the Hamming space E = q" consisting of words of length n equipped with the Hamming 
distance given by 

rf(u,v):=|{2|u, 7^v,}|. (3.1) 

For a word u G E, we denote the support of u by S{u) := {i \ Uj ^ 0}. Note that 
|S'(u)| = d{u, 0), where is the all-zero word. This number is called the weight of u. 

Denote by Aut(g, n) the set of permutations of E that preserve the Hamming distance. 
It is not hard to see that Aut(g, n) consists of the permutations of E obtained by permuting 
the n coordinates followed by independently permuting the alphabet q at each of the n 
coordinates. If we consider the action of Aut(g, n) on the set E x E, the orbits form 
an association scheme known as the Hamming scheme H{n,q), with adjacency matrices 
Aq, Ai, . . . ,An defined by 

I otherwise, 

for i = 0,1, . . . ,n. The adjacency matrices span a commutative algebra over the complex 
numbers called the Bose-Mesner algebra of the scheme. 

We will now consider the action of Aut(g, n) on ordered triples of words, leading to 
a noncommutative algebra Aq^n containing the Bose-Mesner algebra. To each ordered 

21 
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triple (u, V, w) G E x E x E we associate the four-tuple 

(i(u, v,w) := {i,j,t,p), where (3.3) 

i := (i(u, v), 
j ■.= d{u,w), 

t := \{i \ Ui ^ \i and Uj ^ Wj}|, 
p:= \{i I Uj 7^ Vi = Wi}|. 

We remark that the case q = 2 is special since in that case we always have p = t. Note 
that (i(v, w) =i+j — t — p and \{i \ Uj 7^ Vj 7^ w.j 7^ Uj}| = t — p. The set of four-tuples 
{i,j,t,p) that occur as (i(u, v,w) for some u, v,w G E is given by 

X(2,n) := {{i,j,t,p) \ < p = t < i, j andi + j < n + t}, (3.4) 

and 

X{q,n) := {{i,j,t,p) \ < p <t < i,j and i + j < n + t}, (3.5) 

for g > 3. The sets I{q,n) will be used to index various objects defined below. 
Proposition 10. Let n > 1. We have 

f(T) forq = 2, 
\I{q,n)\l (3.6) 

[(T) forq>3. 

Proof. Substitute p' := p, t' := t —p, i' := i — t and j' := j — t. Then the integer solutions 
of 

0<p<t<i,j, i + j<n + t (3.7) 

are in bijection with the integer solutions of 

0<p',t',i',j', p' + t' + i' + f <n. (3.8) 

The proposition now follows since 

|{(ni,n2,...,nfe) G Z>o I ni H ^ Uk = n}\ = i j. (3.9) 

D 

The integers i,j,t,p parametrize the ordered triples of words up to symmetry. We 
define 

Xij,t,p ■= {(u,v,w) G E X E X E I d{u,\,w) = {i,j,t,p)}, (3.10) 

for {i,j,t,p) G X(g,n). The meaning of the sets Xij^t,p is given by the following proposi- 
tion. 

Proposition 11. The sets Xij^t,p, {hJ-,tiP) ^ 2r(g, n) are the orbits o/E x E x E under 
the action of Aut{q,n). 
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Proof. Let u, v,w G E and let {i,j,t,p) = (i(u, v,w). Since the Hamming distances 
hJii+3~i~V and the number t — p = |{i | Uj 7^ Vj 7^ Wj 7^ Ui}| are unchanged when 
permuting the coordinates or permuting the elements of q at any coordinate, we have 
(i(u, V, w) = (i(7ru, vrv, vrw) for any tt G Aut(g, n). 

Hence it suffices to show that there is an automorphism vr such that (vru, vrv, vrw) only 
depends upon i,j, t and p. By permuting q at the coordinates in the support of u, we 
may assume that u = 0. Let A := {i \ \i ^ 0,Wj = 0}, B := {i \ \i = 0,Wj 7^ 0}, 
C := {z I Vj 7^ 0, Wj 7^ 0,Vj 7^ Wj} and D := {i | Vj = Wj 7^ 0}. Note that 1^41 = i — t, 
\B\ = j — t, \C\ = t — p and \D\ = p. By permuting coordinates, we may assume that 
A = {l,2,...,t-t},B = {t-t+l,...,i+j-2t},C = {i + j-2t + l,...,i + j-t-p} 
and D = {i+j — t — p+l,...,i + j — t}. Now by permuting q at each of the points in 
AUBUCUD,we can accomplish that \, = 1 ior i e AUCUD and Wj = 2 for i G -B U C 
and Wj = 1 for i ^ D. D 

Denote the stabilizer of in Aut(g, n) by Auto(g,n). For {i,j,t,p) G X{q,n), let M-'J 
be the E x E matrix defined by: 



1 if|5(u)|=z, |5(v)|=j, |^(u)n5(v) 



{Ml'JU,,:={ |{^|v, = u,^0}|=p, (3.11) 



otherwise. 
Let Aq^n be the set of matrices 

E ^';'Kf' (3-12) 

(«J,t,p)eX(g,n) 

where x/^ G C In the binary case, we will usually drop the superfluous p from the 
notation and write x* ,■ and Mf ,. 



From Proposition 11 it follows that Aq^n is the set of matrices that are stable under 
permutations tt G AvXo{q,n) of the rows and columns. Hence Aq^n is the centralizer 
algebra of Auto(g, n). The M^J constitute a basis for Aq^n and hence 

m if? =2, 

dim^,,„ = { (3.13) 

(T) if?>3, 



by Proposition 10 , Note that the algebra Aq^n contains the Bose-Mesner algebra since 



(«,j,t>p)6X(9,n) 
i+j—t—p=k 

We would like to point out here, that Aq^n coincides with the Terwilliger algebra (see 
j) of the Hamming scheme -ff (n, q) (with respect to 0). Recall that the Terwilliger alge- 
bra Tq^n is the complex matrix algebra generated by the adjacency matrices Aq, Ai, . . . , A„ 
of the Hamming scheme and the diagonal matrices Eq, E[, . . . , E'^ defined by 

/p,N fl if|5(u)|=i, 

(^Ju,u ■■= <^ ^- . (3.15) 

U otherwise, 
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for i = 0, 1, . . . ,n. 

Proposition 12. The algebras Aq^n o-nd Tq^n coincide. 

Proof. Since Aq^n contains the matrices Aj. and the matrices E'^, = M^'^ for A; = 0, 1, . . . , n, 
it follows that Tq^n is a subalgebra of Aq^n- We show the converse inclusion. In the case 
g = 2 this follows since 



M„ 



t,t 



EiAi+j^2tEj, 



as is readily verified. We concentrate on the case g > 3. Define the zero-one matrices 
Bi, d, Di e Tq,n by 

(3.17) 



B. 


■■= E[A,E[, 


c. 


■■= E[A,E[^,, 


D. 


■■= E[A,EU 



Observe that: 

(-Bi)u,v 
(^j)u,v 
(A)u,v 



1 if and only if 

\S{xx)\ = ^,d{u,^r) = 1, |5(v)| = t,S{u) = 5(v), 
1 if and only if 

\S{u)\ = ^,d{u,^r) = 1, |5(v)| = ^ + 1, \S{u)AS{^r)\ 
1 if and only if 

\S{u)\ = ^,d{u,^r) = 1, \S{^r)\ =t-l, |5(u)A5(v)| 

t,p 



(3.18) 



1. 



For given {i,j,t,p) G X(g, n), let A-'^ G 7^,„ be given by 



A^^ := (AA-i ■ ■ ■ Dt^^)iCtCt+l ■ ■ ■ Q-i)(5, 



\t-p 



(3.19) 



Then for words u, v G E, the entry (A/^)u,v counts the number of (i + j — t — p + 3)-tuples 



hj 



u = di, di_i, . . . ,dt = Ct, Ct+i, . . . , Cj = bo, . . . , b 



t-p 



(3.20) 



where any two consecutive words have Hamming distance 1, the b^ have equal support 
of cardinality j, and [^(dfc)! = k, |S'(cfc)| = k for all k. Hence for u, v G E the following 
holds. 



(^jj)u,v 



if d{u, \)>i + j — t — p or 
|5(u)A5(v)| >i + j-2t 



(3.21) 



and 



(4'^^),,v >0 if|5(u)|=^, |5(v)|=j, 

d{u, v) =i+j~t — p and 
|5(u)A5(v)| =i + j-2t. 



(3.22) 
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Equation (3.21) follows from the triangle inequality for d and (i'(x,y) := |S'(x) fl 5'(y)|. 
To see (3.22 ) one may take for d^ the zero-one word with support {i + l — k, . . . ,i}, for c^ 
the zero-one word with support {i + l — t,...,i + k — t} and for b^ the word with support 
{i + 1 — t, . . . ,i + j — t} where the first k nonzero entries are 2 and the other nonzero 
entries are 1. 

Now suppose that Aq^n is not contained in Tq,n, and let M-'J be a matrix not in Tq^n 
with t maximal and (secondly) p maximal. If we write 

4' = E<f'K/' (3.23) 

t',p' 



then by (3.21 ) x^f = ii t' + p' < t + p oi t' < t implyi ng th at A*'^ - a;-;^M*j' G 7^,„ by 



the maximality assumption. Therefore since x/^ > by (3.22), also M^'f belongs to Tq 



«j 



rt,P 



hj "^'■'-'^'■S^ ^^ 'q,ni 



a contradiction. D 

3.2 Block diagonalisation of An 

A block diagonalisation of An '■= A2,n was first given by Schrijver in [38]. In this section 
we will describe this block diagonalisation. In the next section we will use it to describe 
a block diagonalisation of Aq^n for general q. 

Let ra be a fixed positive integer and let V = Vn denote the collection of subsets of 
{1, . . . , n}. It will be convenient to identify binary words with their supports (as elements 
of V). We will use capital letters to denote sets. For convenience, we use the notation 



that is 



C, := MIZL (3.24) 



(C^ -i^ if|X| = z-l,|F|=z,XCF, 
U otherwise 



for 2 = 0, ... n. In particular observe that Co is the zero matrix. The matrices Ci, . . . , C„ 
and their transposes Cj, . . . , Cj play a prominent role in the block diagonalisation of An- 
They generate the algebra An as can be easily seen from the identities 

a+iC7+i - CjC, = {n- 2i)E[ (3.26) 

and 

n 

Y,{C. + CJ) = M,. (3.27) 

i=0 

Indeed, the adjacency matrix Mi of the Hamming cube generates the Bose-Mesner algebra 
of the Hamming scheme. Since / = 'Y^^=i E'i is in the Bose-Mesner algebra it follows 



by (3.26) that also the diagonal matrices E[, . . . , E'^ are in the algebra generated by 

^Ij ■ ■ ■ J <-^n; '-'1 ) • • • ; <-'„ • 

For k = 0, . . . , [|J , define the linear space Lk to be the intersection of the space of 
vectors with support contained in the collection of sets of cardinality k, and the kernel of 
Ck'- 

Lk := {6 G M^ I Ckb = 0, 6x = if |X| ^ k}. (3.28) 
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Proposition 13. For each k < [|J the dimension of Lk is given by 

Proof. It suffices to prove that Ck has rank {jj^i)- This follows since for any nonzero 
X G M'^ with xi = when \I\ ^ k — 1, we have CkX ^ 0. Indeed 

x'^CkC'^x = x'^C'^^^Ck-iX + {n-2k + 2)x'^x > (3.30) 



by (3.26). D 



Before giving an explicit block diagonalisation, we will ffist sketch the basic idea. Let 
b E Lk he nonzero and consider the vectors b^, bk^i,bk+2, ■ ■ ■, where b^ := b and 

bi+i := Cj+i ■ • • Q.+2Cfc+i^ (3.31) 



for i > k. It can be shown (see Proposition 15 below) that 



1 

(n — 2k\ 2 
^_^j {t-k)l (3.32) 

It follows that bi is zero for i > n — k and nonzero for i = k, . . . ,n — k. Since the bi have 
disjoint support, bk, ■ ■ ■ , bn-k are an orthogonal basis for the linear space H they span. 



From (3.26) it follows that 

C,+i6,+i = Q+,Cl,k = {n- 2i)h + Cj{CA) (3.33) 

and hence, since Ckbk = 0, that 

i 

C,+^h+^ = h-Y,{n-2s). (3.34) 

s=k 

The space Vb is thus mapped to itself by each of the C, and Cj and hence by every 
M e An- The action of An restricted to H is determined by 

n—k i 

Ci+i(Y^Xjbj) = Xi+iC^{n - 2s))bi (3.35) 

j=k s=k 

n—k 

j=k 

and does not depend on the particular choice of 5 G Lk, but only on k. If we take for 
each k an orthonormal basis of Lk and let b range over the union of these bases, we will 
obtain a decomposition of M'^ as a direct sum of orthogonal subspaces Vf,. This yields a 
block diagonalisation of An, where for each k there is a block of multiplicity dimLfc. In 
order to obtain a formula for the image of M * • in each of the blocks, we need to express 
M* • (as a polynomial) in the matrices Ci and Cj. 
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We will now give a detailed proof, see also ^8\ . We begin by giving a convenient way 
to express the matrices M^j in terms of Ci, . . . , C„, Cj, . . . , Cj. A first observation is that 

{k - i)Ml^ = Ml^_^Ck for all i < k. (3.36) 

An important consequence is that 

Ml J) = for al\i<k and & G L^. (3.37) 

Secondly, we have the following identity. 

Proposition 14. For all l,k,p & {0, . . . , n} .' 

M[,k = E(-1)^""(')m!.^.%- (3.38) 

Proof. The entry of 

i2(-^y~H'}jKsKk (3.39) 

in position (X, Y), with |X| = k, \Y\ = I and |X fl F | = t, equals 

ti'H'^d) - B-D'-'QC::) (3-40) 






This last sum equals zero ii t y^ p and equals 1 if t = p. D 

The following proposition gives the inner products between vectors of the form MKb, 
where b E L^. These will be used to construct an orthonormal basis with respect to which 
the algebra is in block diagonal form. 

Proposition 15. For i,j, A;,/ G {0, . . . ,n} with k, I < [|J, and for c E Li,b E L^: 

I otherwise. 

Proof. Clearly MJ^MK = ii i ^ j, hence we may assume i = j in the remainder of the 



proof. By (3.38) we have ior < p < k, I: 



z^Mff^b = {^ ' ^P^ (3.42) 

' ' otherwise, 
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since c^M^^ = ioi s ^ I and M|^6 = for s 7^ A;. Hence we obtain 

p=0 ^ ^ 



K.rr-_f\cn. 



Define for i, j, k,t E {0, . . . ,n} the number 






n-2k 
i — k 



E(-i) 



k-p f^\ fi — p\ fn + p — i — k 



p=Q 



pj \t — pj \n + t ~ i — j 



These numbers will be used to describe the block diagonalisation. 
Proposition 16. For i,j, k,t E {0, . . . ,n} with k < |_|J , and for h E L^: 

'n - 2k^ 



i — k 



Ml^Ml,b = l3l,,Mt^,b. 



Proof. By (3.38), it follows that for < p < n: 



Ml,b 



-D'^-^f jMi6. 



This implies that 



KMkb 



Z-.yt-pJ\n + t-t-jl ^'' 

^Q\t-p)\n + t-i-i 



k—p 



P 



Mhh. 



This proves the proposition. 



(3.43) 



D 



(3.44) 



(3.45) 



(3.46) 



(3.47) 



D 



We will now describe the block diagonalisation. For each k = 0, . . . , [|J , choose an 
orthonormal basis B^ of the linear space L^. By Proposition 13 we know that |i?fc| = 
a)-(.-i)-Let 



V:={{k,b,i) \ke{0, 



n 
L2. 



},be Bk,i e {k,k + l,...,n- k}}. 



(3.48) 
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Then 

= y( "" \ = y(''\=2\ 

■^^ \niin|2,n — iW ^-^ \% I 
Define for each (A;, 6, -j) G V the vector Uk,b,i £ I^'^ by 

^^.-(''rf) ^M^,b. (3.50) 



If follows from Proposition 15 and |V| = 2" that the vectors Uk,b,i form an orthonormal 
base of M'^. Let U be the V xV matrix with Mfc,6,j as the {k, b, i)-th column. We will show 
that for each triple i,j,t the matrix 

Ml^ := U^Ml^U (3.51) 

is in block diagonal form. Indeed we have 

Proposition 17. For {l,c,i'), {k,b,j') G V and i,j,t G {0, . . . ,n}.- 

{Mlj){i,c,i'),{k,b,j') = I (3.52) 

[_0 otherwise. 

Proof. We have 

^v_J M^M^^b (3.53) 



1 



1 1 



-1 



n-2k\ ^/n-2k^ 
j-k) \i-k ^ ^*'^'' 






Since 



iMl)(i,c,n{k,bj') = ^Ici'MljUk^bj' (3.54) 

the proposition follows. D 

Proposition 18. The matrix U gives a block diagonalisation of An. 



Proof. Proposition 



17 



implies that each matrix M* • has a block diagonal form, where for 
each /c = 0, . . . |_|J there are Q) — {^^ copies of an (n + 1 — 2k) x (n + 1 — 2k) block on 
the diagonal. For each k the copies are indexed by the elements of Bk, and in each copy 
the rows and columns are indexed by the integers i G {fc,A; + l,...,n — A;}. Hence we 
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need to show that all matrices of this block diagonal form belong to U AnU . It suffices 

I ^ I 
to show that the dimension X]fc=o (^ + 1 ^ 2/:;)^ of the algebra consisting of the matrices 

in the given block diagonal form equals the dimension of A^ which is ("^ ) . This follows 

by induction on n from 



n + 3 
3 



Remark 1. Since 



n 



2) 
3 



n + l 
1 



n + l 
2 



in + l] 



{Ml 



u\Miyu 

U^MlJJ 



(3.55) 

D 

(3.56) 



j,«' 



if follows from Proposition 11 that Plj^ = Pjik, which is not obvious from the definition 



of Plj^i^ . In IWj, Proposition 
different expression for /3* ■ f^ 



11 is derived in a slightly different manner, resulting in a 
much displays the symmetry between i and j : 



A* 



i,j,k 



u=0 



\u—t 



n-2k 
u — k 



n — k — u 
i — u 



n — k — u 
j -u 



(3.57) 



3.3 Block-diagonalisation of Aq^n 

In this section we give an explicit block diagonalisation of the algebra Aq^n- The block 
diagonalisation can be seen as an extension of the block diagonalisation in the binary 
case as given in the previous section. There the binary Hamming space was taken to 
be the collection of subsets V of {1, . . . ,n}. Now it will be convenient to replace this 
by the collection of subsets of a given finite set V. Let ^ be a finite set of cardinality 
\V\ = m. By V{V) we denote the collection of subsets of V. For integers i,j, define the 
PiV) X ViV) matrix C^j by 



{Cij)i,j 



1 if|/|=i, |J| 
otherwise. 



j, / C J or J C /, 



(3.58) 



The matrices CYi^ correspond to the matrices M^j^ from the binary Terwilliger algebra. 

For 

k 



We have renamed them in order to avoid confusion with the matrices M^'J G Aq^n- 
0, . . . , [yj define the linear space L^ by 



v 



{x e C^(^) I C, 



V 

fc-l,fc 



X 



0, xi = if \I\ ^ k}, 



(3.59) 



and let B^ be an orthonormal base of L^. For z, j, k,t G {0, . 



m,t 
k 






m — 2k 
i — k 



m 
p=0 



\k-p 



I ~ p 
t — p 



m}, define the number 
(3.60) 



m + p — i — k 
m + t — i — j 



We recall the following facts. 
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Proposition 19. Let V be a finite set of cardinality m. Then 
(i) Forke{0,..., [fj} we have 

_y I in\ I m 



•^""^'-U. -U-il- <^«^' 



V 



(a) For i,k,l e {0,...,n} with k, I < [f\, and for b E L^ , c E L] 

(C^.^crO=|r"'^''^ 1^^^' (3.62) 

lU otherwise. 

(izz) For i,j, k,t E {0, . . . ,n} with k < [^\, b E Ll andY CIV with \Y\ = j 

E (0)!^ = ^5i("':fy'(C]ii)v. (3.63) 

\U\=i 

\unY\=t 



Proof. Items (i),(ii) and (ill) follow directly from Propositions 13, 15 and 16 D 



We will now describe a block diagonalisation of Aq^n- Let G C be a primitive 
(g — l)-tli root of unity. Let 

V := {{a,k,i,si,b) \ (3.64) 

a, k, i are integers satisfying 0<a<k<i<n + a — k, 
a G q"" satisfies |5'(a)| = a, a/^ 7^ g — 1 for /i = 1, . . . , n, 



where U := {1,2, ... ,n}\U for any set f/ C {1, 2, . . . , n}. For each tuple (a, k, i, a, b) in 

.a,k,.' 
a,b 

,a,k,i\ 



V, define the vector ^E'"^'* G C" by 



' (9 - l)-^^rr-"/')"'<^^^'''H^S-a^)^(x)\5(a) if ^(a) C ^(x), (3.65) 

otherwise, 

for any x G q". Here the nonnegative integer (x, y) is given by 

n 

(x,y) := ^x^y;, (3.66) 



h=0 



for any x, y G q". We stress that (x, y) is not taken modulo q. Observe that (^E'a b'')x = 
if |5(x)| ^i. We have: 



Proposition 20. The vectors ^E'^'f,'*, {a,k,i,a.,b) E V form an orthonormal base of C^" . 
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a,k,i 



Proof. First, the number |V| of vectors ^I/^f,' equals g" since: 



E 



a,k,i 
Q<a<k<i<n+a—k 



n 



{q-2r 



n — a 
k — a 



n — a 
k — a — 1 



rmn{i,n+a—i) 

EEClto-^)" E 



i=0 a=0 

n i 



k=a 



n — a 
k ~ a 



n — a 
k — a — 1 



EE(:)fe-r( 

j=0 a=0 ^ ^ ^ 

n y s i 

E(')E(«-2r 

i=0 ^ ^ a=0 

e(:)(^ -!)'-» 

i—n \ / 



n — a 
niin{z — a,n — i} 



(3.67) 



Secondly, we calculate the inner product of ^I/g^'* and ^I/a/'f,/'* • li i j^ i' then the inner 
product is zero since the two vectors have disjoint support. So we may assume that i' = i. 
We obtain: 



,a,k,i ,T,a' fe 






n + a - 2A;\ ^ fn + a' - 2k' 



i — k J \ i — k' 

Ej(a,x)-(a',x)/^S'(ay ,n /^S(a') ,/n 

^ H<-'i-a,fc-a»J5(x)\S(a) ■ m_a',fc'-a'»J5'(x)\5(a')> 



(3.68) 



where the sum ranges over all x G q" with |5'(x)| = i and 5'(x) 3 S'(a) U S'(a'). If a^ 7^ a'- 
for some j, then the inner product equals zero, since we can factor out X^^^^ (jy^^^^^'^j' = 
0. So we may assume that a = a' (and hence a = a'), which simplifies the right-hand side 

of (Is^esl to 



n + a- 2k\ ^ fn + a- 2A;'^ 2 



/^5(a) ,NT^S{a) ,/ 

i-k I V i-k' ' ^ i-o-,k^<^^ '-'i-a,k'-a" ' 

Indeed, since a' = a, we observe that 

^(a,x>-(a,x> _ ^^ 

and hence the summand only depends on the support of x. We obtain 

Yl ('^^a,fe-a^)5(x)\5(a) " (C,^a,fe/_a^')5(x)\S(a) 

|5(x)|=iJ(x)35(a) 

X 
\X\=i,XDS{a) 



(3.69) 
(3.70) 



(3.71) 



yC5(a) 
\Y\=i-a 
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From equation (3.69) and Proposition 

ii k = k' and b = b' , in which case the inner product equals 1 



19 



we conclude that ( '^"^^^ ., ^ab'* ) ^^ nonzero only 



D 



The block diagonalisation will follow by writing the matrices M^ ■ with respect to the 



,a,kA 



«J 



new orthonormal basis of C formed by the vectors ^l/a'f,' • To this end we define for 
i,j,t,p, a,k E {0, . . . ,n} with a < k < i,j the number 



a(,t,j,t,P,a,k) :=0:::'::.M - 1)'<'+''-' 



ff=0 ^^^ 



t — a 
p-9 



{q - 2)*-"-P+^. 



(3.72) 



We obtain the following. 

Proposition 21. For {i,j,t,p) GX{q,n) and {a,k,i',a,b) &V we have: 






5iA' 



n + a — 2k 



^ fn + a-2k 
j -k 



a,k,j 



a{i,j,t,p,a,k)^''^f^ 



(3.73) 



Proof. Clearly, both sides of (3.73) are zero ii i ^ i', hence we may assume that i 



We calculate (^,-,f^ab'*)y ^^ '^^y assume that |5'(y)| = j, since otherwise both sides 



of (3.73) have a zero in position y. We have: 



t,p.j,a,k,i\ 



(Mf^ 



xeq" 



(3.74) 



(q-iy 



I,- fn + a — 2k\ ^ 
i — k 



E<^^"^^(^2i-^)^w\^( 



a), 



where the last sum ranges over all x G q" with |5'(x)| = i, S'(x) 3 S{sl), |5'(x) fl 5'(y)| = t 
and \{h | x/, = y/, 7^ 0}| = p. 
We will work out the sum: 



E 



0^^'^^(CfJ:La&)s(x)\5(a). 



(3.75) 



|5(x)|=i,S(x)25(a) 
|5{x)nS(y)|=t 
|{/i|xh=yh7^0}| 



If there exists an /i G ^(a) \ S'(y), we can factor out ^11=1 'P^^'^ — O5 implying that 



both sides of (3.73) have a zero at position y. Hence we may assume that ^(y) ^ S'(a). 



Now the support of each word x in this sum can be split into five parts f/, f/', V, V, W, 
where 



U = {/i G ^(a) I X, = y,,} 

U' = 5(a) \t/, 

V = {/i G S(y) \ 5(a) I X, = y,}, 

V = ((S(y) \ 5(a)) n S(x)) \ V^, 
W = 5(x)\5(y). 



(3.76) 
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Setting g := \U\, gives \U'\ = a — g, \V\ = p — g, \V'\ =t — a — p + g and \W\ = i — t. 
Hence splitting the sum over g, we obtain: 

p 

/ . / . {Ci-l,k-a^)vuV'UW 

g=0 u,u',vy',w 

n '^"^'"' n (-<^''''') n 1 n (^ - 2) n (? - 1)> i^-^v 

heu heu' hev hev hew 

where U, U', V, V, W are as indicated. Substituting T = V UV U W, we can rewrite this 
as 



g(:)C::)<-^'"""-^''-^-'" 



(, _ ly-'^M J2iCf%.,b)r, (3.78) 



T 



where the sum ranges over all T C ^(a) with \T\ = i — a and \T fl >S'(y)| = t — a. Now by 
Proposition ITotiii), this is equal to 



(^ - 1)'"* E Q Q ""J i-^r'ii - 2)*-»-^+^- 



,=0 -^ ^p-9 



^^'^^ ^^" ";. 1 ^'^) "/3r;r:.-.(^S-.^)-(y)\5(a)> (3-79) 



which equals 



,a,k,j\ nn—a,t~a f 71 -\- a ZK 



(^S'^)y ■ ^IPl^-A ^ _ ^ ) (^ - 1)^-'^^^- 



i:(-ir^Q(^_^)(^-2)*-'^-^^. (3.80) 



This completes the proof. D 

If we define V to be the q" x V matrix with ^1/^';,'* as the (a, fc, i, a, 6)-th column, then 



Proposition 21 shows that for each {i,j,t,p) G I{q,n) the matrix M^'j' := U*M-'^U has 
entries 



(-^i,j )(a,fc,/,a,6),(a',fc',«',a',fe') 
1 



n-fc'')''^-"/') '(y{i,J,t,p,a,k) ifa = a',fc = fc',a = a',6 = 6'and 

l = t,r=j, (3.81) 

otherwise. 



This implies 
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Proposition 22. The matrix U gives a block diagonalisation of A, 



q,n' 



Proof. Equation (3.81 ) implies that each matrix M^^ has a block diagonal form, where for 
each pair (a, fc) there are Q)(g-2)'' [(^I") - („"~"i)] copies of an (n+a+l-2fc) x (n+a + 
1 — 2/c) block on the diagonal. For fixed a, k the copies are indexed by the pairs (a, h) such 

that a G q" satisfies |5'(a)| = a, a^ 7^ g — 1 for h = 1, . . . , n, and h G B^_^. In each copy 
the rows and columns in the block are indexed by the integers i with k<i<n + a — k. 
Hence we need to show that all matrices of this block diagonal form belong to U*Aq^nU. 
It suffices to show that the dimension J2o<a<k<n+a~ki^ + a + 1 — 2kY of the algebra 
consisting of the matrices in the given block diagonal form equals the dimension of Aq^n, 
which is ("^^ ) . This follows from 

Y^ {n + a + l-2kf 

0<a<k<n+a— k 

I 71 — a I 

n L— 2~J 

a=0 k=o (3.82) 

E/n — a + 3^ 

a=0 ^ 

n + 4\ 
4 I 

D 

This implies the following result. 

Theorem 4. The matrix 

M= Yl <'Xj (3-83) 

is positive semidefinite if and only if for all a, k with 0<a<k<n + a — k the matrices 

\ n+a—k 

^a(z,j,t,p,a,/c)x-'n (3.84) 

*'P / i,j=k 

are positive semidefinite. 

Proof. The matrix M is positive semidefinite if and only if U*MU is positive semidefinite. 



Since U*MU is in block diagonal form, where the blocks are exactly the matrices in (3.84) 



each with multiplicity at least one, the theorem follows. D 

Theorem 5. The matrix 

R { E ^*"<n (3-85) 

\(ij,t,p) / 



36 CHAPTER 3. THE TERWILLIGER ALGEBRA OF H{N, Q) 

is positive semidefinite if and only if for all a, k with 0<a<k<n + a — k and k ^ 
the matrix 

\ n+a—k 

^a(i,j,t,p,a,A;)x.'n (3.86) 



is positive semidefinite, and also the matrix 



1 x'^ 
X L 



is positive semidefinite, where 



and 



Proof. Let 



Observe that 



I ' - ■ ''' 



Since 



n 



Y^{Qfl,i,0,l) 



j=0 



(3.87) 



L:= |5^a(^,j,t,p,0,0)x*;n (3.88) 



Xi=C.\{q-lY-xYi fort = 0,...,n. (3.89) 



M ■■= E <''<f (3-90) 



1 ^\\(M)f^ ^\-f ^ {dmg{M)yU\ 

UJ ^^^^^Mo f/ J - I f/*diag(M) U*MU '' ^"^-^^ 



:(,-ir*r- : 



X-*'°' = ( , I (X - l)'*Sf = ( , I (X - l)-C(o,o,.A.) (3.92) 

and U*U = I, we see that 

f/*diag(M) = f/*5]]a:^;k^'^°^ (3-93) 

j=0 

= i:<s (I) («-!)■ 



has nonzero entries only in the block corresponding to a = A; = 0. The theorem follows. 

n 
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3.4 The Terwilliger algebra of the Johnson scheme 

The Hamming scheme is a natural and powerful tool in studying subsets of the binary 
Hamming space with prescribed distance relations. In particular, the Delsarte bound 
gives good upper bounds on the size of a code. In the case of constant weight codes, one 
considers subsets of the Johnson space, consisting of the subsets of some fixed size w. 
Now the appropiate tool to use is the Johnson scheme. 

Let w < n he positive integers and let P^ be the collection of subsets of {1, . . . ,n} 
of cardinality w. So Vn is the disjoint union of P°,'P^,. . . ,7^^. We will assume that 
w < |_|J. This is not a severe restriction since V^ and Pn~^, with the Hamming dis- 
tance, are isomorphic. This is because the Hamming distance is preserved under taking 
complements: d{U, V) = d{U, V) for sets U,V E {I, . . . ,n}. It is convenient to define the 
Johnson distance dj by 

dj{U,V):=w-\UnV\ = ^d{U,V). (3.94) 

We denote by Aut{n,w) the set of automorphisms of the Johnson space. It is easy to 
see that the automorphisms are just the permutations of V^ induced by permuting the 
ground set {1, . . . , n}. The distance relations Rq, . . . , R^ given by 

Rd := {{U, V)eV:xV:\ dj{U, V) = d} (3.95) 

are precisely the orbits under the action of Aut{n,w) on V^ x V^'- 

Ra = {(aU, aV)\a E Aut(n, w)}, when dj{U, V) = d. (3.96) 

Hence Rq, . . . , i?^, form an association scheme called the Johnson scheme J{n,w). The 
Bose-Mesner algebra of the Johnson scheme is spanned by the matrices Ad G C'^" ^'^" , 
d = 0, . . . ,w given by 

(A) -/^ iidj{U,V) = d 
I (J otherwise 

Like in the case of the Hamming scheme, it useful to consider the refinement of the 
Johnson scheme obtained by replacing the full symmetry group Aut(?T,, w) by the stabilizer 
subgroup Autvy(n, w) of some arbitrary element W G V^. Therefore we fix some W G V^- 
Consider the complex algebra T spanned by the 0-1 matrices M^'- where < s < i,j < w 
and t < w — i,w — j, given by 

ii\unw\ = i,\vnw\=j, 
{M-j)u,v:={ \unvnw\ = s,\unv\w\=t. (3.98) 

otherwise 

It is not hard to verify that supports of the matrices M^'- correspond to the orbits of 
V^ X Vn under the action of Autvi/(n, w). The algebra T is in fact the Terwilliger algebra 
of the Johnson scheme J{n, w) with respect to W. 

We will give a block diagonalisation of the Terwilliger algebra of the Johnson scheme. 
This was implicit in the work of Schrijver (|38j). 
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algebra A^, 



:= Aw ® An~w be the tensor product of the algebras A^ and An~ 
consists of the matrices 



where x 



t,t' 



V X*'*' M* 

I E C From Section 



3.2 



/Vf* 

■n—w;i',j'i 



we obtain matrices U,,, and f/„ 



The 



(3.99) 



and Uj_,„An-wUn-w are in block diagonal form. It follows that U : 



diagonalises A 



such that U^AujUw 
= U,„ ® U„-,i, block 



w,n—w smce 

U'^ A U = U'^A U 






It follows from Proposition 17 that the blocks are indexed by the pairs 



UJ 

(/c,A;')e{0,l,..., - }x{0,l,... 



w 
2 



n — w 



}• 



(3.100) 



(3.101) 



For each such pair (fc, k') we have a block -Bjfc^fc', consisting of all (\4 x Vl') ^ (^ ^ ^fc') 
matrices, where Vk := {A;, . . . ,w — k} and V^'; := {A;', . . . ,n — w — A;'}. The image of (3.99) 
in block (A;, A;') is given by 



Et,t' nw,t nn-w,t' \ fw-2k\ (w-2k\ (n-w-2k'\ (n-w-2k'\ 

^i,j,i',j'Pi,j,k ■ Pi'j',k' [ i-k )\ j-k ) \ i'-k' ) \ j'-k' ) 



(3.102) 



t,t' 






By extending each matrix in T by zeros to a P^ x Vn matrix, and identifying Vn with 
'Pw X Vn-w (by identifying U and {UnW, U\W) for any U G Vn), the Terwilliger algebra 
T can be seen as a subalgebra of Aw,n-w, where M^j is identified with M^j (g) Ml^_^^^_-. 
It follows that in the block diagonalisation given above, T is mapped in block (A;, k') to 
those matrices that have nonzeros only in positions with rows and columns indexed by 
{{i,w — i) I i G Vk,w — i G V^,}. Hence restricting each block to those indices, we obtain 
a block diagonalisation of T with blocks of size 

|{A;, . . . , n - A;} n {2u; - n + A;', . . . , u; - A;'}| (3.103) 

for each pair (A;, k') with k + k' < w. This was used in [38] to obtain bounds on constant 
weight codes. 

In the nonbinary case, let E be the set of g-ary word of length n and weight w, equipped 
with the Hamming distance. The g-ary Johnson scheme Jq{n,w) has adjacency matrices 
Mi 
group of E: 



t^p for < p <t < w given by the orbits of E x E under the action of the automorphism 



(Mt,,; 



x,y 



1 if|^(x)nS(y)|=t,|{z|x,, = y,^0}| 
otherwise. 



P, 



(3.104) 



Replacing the full automorphism group by the stabilizer of some word w G E we obtain 
an algebra containing the Bose-Mesner algebra of the nonbinary Johnson scheme which 
may serve to improve bounds for constant weight codes in the nonbinary case. We do not 
know if this is the Terwilliger algebra (with respect to w) of the nonbinary Johnson scheme 
Jq{n,w). The algebra is a subalgebra of a tensor product of Aq^n-w and an algebra of 
dimension ('"^ ) (or ('"^ ) if g = 3). It would be interesting to find a block diagonalisation 
of this algebra. 



Chapter 4 

Error correcting codes 



Given a code C C E := g*^, the minimum distance of C is defined to be the minimum 
of {(i(u, v) I u 7^ v,u,v G C}. The maximum cardinahty of a code with minimum 
distance at least d is denoted by Ag{n, d). In this chapter we give new upper bounds on 
Aq{n, d) based on a semidefinite programming approach, strengthening Delsarte's hnear 
programming bound. For more information on coding theory, the reader is referred to 



4.1 Delsarte's linear programming bound 

Given a code C C E, the (n + l)-tuple (xq, Xi, . . . , x„) defined by 

X, := \C\-' ■ |{(u, v) eCxC\ rf(u,v) = 2}| (4.1) 

is called the distance distribution of the code C. For each i the number Xj equals the 
average number of code words at distance i from a given code word. Observe that Xq = 1 
and xq + Xi + ■ ■ ■ + Xn = \C\. The key observation that leads to the linear programming 
bound is that the following inequalities hold: 

n 

Y^ XiKj{i) > for all J = 0, ... , n, (4.2) 

where 

ir,(x):=|:(-l)'=Q(^:^)(g-l)-^ j = 0,...,n (4.3) 

are the Krawtchouk polynomials. These inequalities give rise to the following linear pro- 
gramming bound on the size of a code with minimum distance at least d: 

n 

Aq{n, d) < max{^ Xj | xq = 1, Xi, . . . , x„ > 0, (4.4) 

i=0 

xi = • • • = Xd-i = 0, 



the Xi satisfy (4.2)}. 
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This approach turned out to be very powerful. Many of the best known upper bounds on 
Ag{n, d) are obtained using this method. 



A proof of the vahdity of (4.2) can be found for example in [T^ 1^ . To illustrate the 
semidefinite programming approach in this chapter, we sketch a proof here. For any code 
C C E, we denote by Mc the 0-1 matrix defined by 

/, ^ X I 1 if u, V G C ,, ^, 

{McU^:={ ' . . (4.5) 

otherwise 



We prove (4.2) 



Proof. Consider the matrix 

1 v^ 

|Aut(g,n)|-|C| 



^■= iA,../„ ^M 1^1 Y. ^-c- (4-6) 



o-£Aut{g,n) 

The matrix M is an element of the Bose-Mesner algebra of the Hamming scheme and the 
coefficients with respect to the adjacency matrices Ai of the Hamming scheme reflect the 
distance distribution: 

n 

M = Y,x,^-^A, (4.7) 

1=0 

where 

7.:=g"(g-irQ (4.8) 

is the number of nonzero entries of A^. Indeed, we have (y4j,Mo-c) = |C'|xj, and hence 
(Aj, M) = Xi for every i = 0, . . . , n and every a G Aut(g, n). 

The matrix M is a nonnegative combination of the positive semidefinite matrices 



M^c and is therefore positive semidefinite itself. The inequalities (4.2) will follow from 
this semidefiniteness by diagonalising the Bose-Mesner algebra. Let the unitary matrix 
U G C^^^ be given by 

(t/)u,v := g-"/^^"'"^ (4.9) 

for u, V G E, where </> is a primitive g-th root of unity. It is a straightforward calculation 
to show that for each i = 0, . . . , n the matrix Ai := U*AiU is a diagonal matrix with 

(A)u,u = Ki{j) = ja-'K,{i) when d{0, u) = j. (4.10) 

Since M is positive semidefinite, also the diagonal matrix U*MU is positive semidefinite, 
which means that all diagonal elements Yll=o^i-^ii^)lJ^ ^ J — 0, . . . n are nonnegative. 



This implies (4.2). D 



In fact, the equivalence of U*MU >z and M ^ shows the following, which we 
mention for future reference. 

Proposition 23. For Xq, Xi, . . . , a;„ G M, we have 

XqAq + xciAi + ■ • • + XnAn ^0 if and only if (4.11) 

XoKj{{]) + XiKj{l) + ■■■ + XnKj{n) > for j = 0, . . . ,n. 
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4.2 Semidefinite programming bound 

In this section we describe a way to obtain upper bounds on Aq{n,d) by semidefinite 
programming. The method strengthens Delsarte's hnear programming bound and was 
introduced by Schrijver in [38] in the case of binary codes. There it was used to find a 
large number of improved bounds for binary codes. While this thesis was being written, 
the same method was used by de Klerk and Pasechnik in [2S] to bound the stability 
number of orthogonality graphs, (or equivalently) the maximum size of a binary code of 
length n in which no two words have Hamming distance |n, where n is divisible by four. 
In this section we will describe this method, but restrict ourselves to the nonbinary 



case. In Section |4.3| we give a list of improved upper bounds that we found with this 
method for g = 3, 4, 5. 

Let C be any code. We define the matrices M' and M" by: 

M' := |Aut(g,n)|-i J^ M,c (4.12) 

(TSAut{(;,n) 

oeo-c 



M" := |Aut(g,n)|'^ ^ M, 



c- 



a£Aut{q,n) 



By construction, the matrices M' and M" are invariant under permutations a G 
Auto{q,n) of the rows and columns. Hence M' and M" are elements of the algebra Aq^n- 
We write 

M'= Yl ^*;'<r (4-13) 

Here the matrices M^'J are the standard basis matrices of the algebra Ag^n- 
The matrix M" can be expressed in terms of the coefficients x^^ as follows. 



Proposition 24. The matrix M" satisfies 



Proof. The matrix 



M"= E (^lS-*-P,o - ^SX^- (4-14) 



M := M' + M" = \Ant{q,n)\-^ 2^ M^c (4.15) 

o-£Aut{q,n) 



is invariant under permutation of the rows and columns by any permutation a G Aut(g, n), 
and hence is an element of the Bose-Mesner algebra, say 

M = Y,ykAk. (4.16) 

k 

Observe that for any u G E with d{u, 0) = k, we have 

2/fc = (M)u,o = (M')u,o = 42, (4.17) 
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since (M")u,o = 0. Hence we have 

M" = M-M' (4.18) 

k {i,j,t,p) 

k i+j-'t—p=k (i,j,t,p) 

which proves the proposition. D 

The coefficients a;/^ carry important information about the code C, comparable to 
the distance distribution in Delsarte's hnear programming approach. Where the distance 
distribution records for each distance d the number of pairs in C at distance d, the 
coefficients x^'^ count the number of triples (u,v,w) G C^ for each equivalence class of 
E^ under the action of Ant{q,n). We express this formally as follows. Recall that 

Xij^t,p ■■= {(u,v,w) G E X E X E I rf(u,v,w) = {i,j,t,p)}, (4.19) 

for {i,j,t,p) G X{q,n). Now denote for each {i,j,t,p) G X{q,n) the numbers 

A*;^:=|(CxCxC)nX„-i,,|, (4.20) 

and let 

7*f :=l({0}xExE)nX,,,-i,p| (4.21) 

be the number of nonzero entries of Mj'j'. A simple calculation yields: 



n 



^ \p,t-p,t-t,j -t; 



The numbers x^j are related to the numbers X^j by 

Proposition 25. x-j = g""'(7i.f)~"^A-'^. 

Proof. Observe that the matrices M^^'j are pairwise orthogonal and that (Mj'J,Mj|') = 
7j'J for {i,j,t,p) G X(g,n). Hence 

{M',Ml'J) = \Ant{q,n)rJ2 E (M^^cM^'J) (4.23) 

«eC creAut((j,n) 

(TU=0 

= \Aut{q,n)\-' ■ |Auto(g,n)| ^-Kju} x C x C)nXij^t,p\ 

use 

= g-"|(C X c X C) n x.j^tj = g-"Ag 



implies that 



M' = q- Yl ^S(^5)"'Kr (4-24) 

(«,i,*,p)6X(g,n) 



Comparing the coefficients of the M^'f with those in (14. 131) proves the proposition. D 



«j 
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Proposition 26. The x^'^ satisfy the following linear constraints, where (Hi) holds if C 
has minimum distance at least d: 

(^) < x*'^^ < 4;° (4.25) 

(a) x^j = x-,j, ift-p = t'-p and 

{i, j,i+j — t— p) is a permutation of (z', j', i + j' — t' — p) 
(ttt) xg = z/{z,j, z+j-t-p}n{l,2,...,d-l}^0. 



Proof. Conditions (ii) and (iii) follow directly from Proposition 25 Condition (i) follows 
from the fact that if M = M^c for some a G Aut(g, n) with G aC , then < Mu_v < Mo,u 
for any u, v G E. D 

An important feature of the matrices M' and M" is, that they are positive semidefinite. 
This follows since M' and M" are nonnegative combinations of the matrices M„c = 
X"^{x"'~^Y which are clearly positive semidefinite. Using the block diagonalisation of 
Aq^ni the positive semidefiniteness of M' and M" is equivalent to: 

for all a, k with Q<a<k<n + a — k, the matrices (4.26) 

n+a—k 



^a{i,j,t,p,a,k)x-^j 
and 




n+a—k 



i,j=k 



are positive semidefinite. 

If we view the x^'^ as variables, we obtain an upper bound on the size of a code of 
minimum distance d as follows. 

Theorem 6. The semidefinite programming problem 

X;Q(g-l)^'° subject to (4.27) 



n 

maximize 

1=0 



Xg'o = 1, and conditions (4.25) and (4-26) 
is an upper bound on Ag{n,d). 



Proof. We first remark that conditions (4.25) and (4.26) are invariant under scaling the 
numbers x^'^ with a common positive factor. The constraint Xqq = 1 serves as a normali- 
sation. If C C E is a code of minimum distance d. Setting 

^S := ?" ■ A:;K| (4.28) 

gives a feasible solution with objective value \C\. D 
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This is a semidefinite programming problem with 0{n'^) variables, and can be solved 
in time polynomial in n. This semidefinite programming bound is at least as strong as the 
Delsarte bound. Indeed, the Delsarte bound is equal to the maximum of XlILo -^j o (T) (^ ^ 



0,0 



1)* subject to the conditions Xq'q = 1, x^', 
and 



0,0 




X 



„o,o 

d-1,0 



0,0 



0, XjQ > for all i = d, 



,n 



y Xj Q Ai is positive semidefinite. 



1=0 

as was shown in the previous section. This last constraint is equivalent to 

y ^i+j-t-p,o"^i,f is positive semidefinite. 



(4.29) 



(4.30) 



J2i+j-t-p=k ^ij- It follows that (|4.29|) is imphed by the condition that M' 



since Ak 

and M" be positive semidefinite, that is condition (4.26). 



4.2.1 Variations 



There are a number of obvious variations to the semidefinite program (4.27), altering the 



0,0 



objective function and the constraint Xqq = 1. For convenience we will optimize over 
matrices 

M--=Y1 ^*;'<:r (4.31) 

in the Terwilliger algebra. Observe that the numbers x^^ are uniquely determined by M 
and vice versa. The semidefinite program (4.27) can be rewritten as 



0,0 



maximize trM subject to (4.25), (4.26) and Xq'q = 1. 
Consider the following two variations 



maximize 



X 



„o,o 

0,0 



(4.32) 



(4.33) 



subject to (4.25), (4.26) and 



.0,0 
^0,0 



0,0 
^0,0 

trM 



^0, 



and 



maximize 1 Ml 



(4.34) 



subject to (4.25), (4.26) and trM = 1. 



The idea behind variation (4.33) is that for a code C, setting 



X. 



*'P := A*'^ ■ q"(t'^) 

1,7 4,7 y V It.l I 



(4.35) 



we obtain a feasible solution with Xqq = \C\ and trM = |Cp. If M' = J2i j^t^pVu ^ij 
is a feasible solution to (4.33), then M := (yQ'Q)~^M' is a feasible solution to (4.32) with 



im 



trM = trM' ■ (yo,'o) ^ — Z/o'o- Conversely, if M' = ^ijtpVijM^'J is a feasible solution to 



4.3. COMPUTATIONAL RESULTS 



45 



M.32), then setting x^^ :: 
Hence both semidefinite 






tiM' gives a feasible solution to (|4.33|) with Xq^q 
programs yield the same value. 



0,0 



The validity of variation (4.34) can be seen by setting x^j : 



A 



t,p 



tH:! 



t,p\-i 



l-trM'. 



CI -2 for 



a given code C. Then trM = 1 and 1^ Ml = \C\. For any feasible solution M ' to (|4.32 ), 
we have 1^ M'l > {tiM'f, hence M := (trM')-^M' is a feasible solution to (|4.34) with 



l^Ml > trM'. It follows that the optimum value in (4.34) is at least the optimum value 



in (4.32). We do not know if the reverse inequality holds. 



4.2.2 A strengthening 



It was observed by Laurent (see |28]) that not only is the matrix M" defined in (4.14) 
positive semidefinite, also the following stronger property holds: 






„o,o 

0,0 



(diag(M"))^ 



is positive semidefinite. 



^diag(M") M" 

This follows from the fact that for a code C and a G Aut(g, n) the matrix 



(4.36) 



1 ix^^V 



X 



aC 



X 



1 



is positive semidefinite 



(4.37) 



and the fact that semidefiniteness is preserved under taking nonnegative linear combina- 
tions. This yields the stronger semidefinite programming bound 



• • n 0,0 

maximize q ■ Xqq 



subject to (4.25), (4.26) and (4.36) 



where 



M" 



Ei-t 



+j-t-pfl 



- ^m. 



«j 



(4.38) 



(4.39) 



i,j,t,P 



Observe that condition (4.36) can be checked in time polynomial in n by Theorem 5 The 



bound obtained is as least as good as the one obtained from (4.32 ). Indeed, given a feasible 



solution to (4.38), the matrix M 
Hence 

g^trM' = l^Ml > trM^ 



M' + M" satisfies: R{M) is positive semidefinite. 



(?^ 






0,0\2 

o,oJ • 



(4.40) 



This implies that A^' := -uijM' is a feasible solution to (4.32) with trA^' > g" ■ Xq'q. Hence 



0,0 



the optimum in(4.38) is at most the optimum in (4.32). In the binary case, this yields an 
improved bound when n = 25 and d = Q. We did not find new improvements using this 
strengthening in the range q = 3,n < 16, q = 4,n < 12 or q = 5,n < 11. 



4.3 Computational results 

The semidefinite programming method was successfully applied to binary codes in [38] 
where a large number of upper bounds were improved. In this section we describe the 
computational results obtained in the nonbinary case. Apart from the binary case, tables 
of bounds on Aq{n, d) are maintained for g = 3, 4, 5. We have limited the computations to 
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these three cases and computed the semidefinite programming bound for the range n < 16, 
n < 12 and n < 11, respectively. The instances in which we found an improvement over 



the best upper bound that was known, are summarized in Tables 4.1, 4.2 and 4.3 below. 
As a reference we have used the tables given by Brouwer, Hamalainen, Ostergard and 
Sloane [H] and by Bogdanova, Brouwer, Kapralov and Ostergard [5] for the cases g = 3 
and g = 4, along with subsequent improvements recorded on the website of Brouwer [9] 
and the table by Bogdanova and Ostergard [6] for the case g = 5. 



Table 4.1: New upper bounds on A^{n, d) 







best 




best upper 








lower 


new 


bound 








bound 


upper 


previously 


Delsarte 


n 


d 


known 


bound 


known 


bound 


12 


4 


4374 


6839 


7029 


7029 


13 


4 


8019 


19270 


19682 


19683 


14 


4 


24057 


54774 


59046 


59049 


15 


4 


72171 


149585 


153527 


153527 


16 


4 


216513 


424001 


434815 


434815 


12 


5 


729 


1557 


1562 


1562 


13 


5 


2187 


4078 


4163 


4163 


14 


5 


6561 


10624 


10736 


10736 


15 


5 


6561 


29213 


29524 


29524 


13 


6 


729 


1449 


1562 


1562 


14 


6 


2187 


3660 


3885 


4163 


15 


6 


2187 


9904 


10736 


10736 


16 


6 


6561 


27356 


29524 


29524 


14 


7 


243 


805 


836 


836 


15 


7 


729 


2204 


2268 


2268 


16 


7 


729 


6235 


6643 


6643 


13 


8 


42 


95 


103 


103 


15 


8 


243 


685 


711 


712 


16 


8 


297 


1923 


2079 


2079 


14 


9 


31 


62 


66 


81 


15 


9 


81 


165 


166 


166 


16 


10 


54 


114 


117 


127 
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Table 4.2: New upper bounds on A4{n,d) 







best 




best upper 








lower 


new 


bound 








bound 


upper 


previously 


Delsarte 


n 


d 


known 


bound 


known 


bound 


7 


4 


128 


169 


179 


179 


8 


4 


320 


611 


614 


614 


9 


4 


1024 


2314 


2340 


2340 


10 


4 


4096 


8951 


9360 


9362 


10 


5 


1024 


2045 


2048 


2145 


10 


6 


256 


496 


512 


512 


11 


6 


1024 


1780 


2048 


2048 


12 


6 


4096 


5864 


6241 


6241 


12 


7 


256 


1167 


1280 


1280 



Table 4.3: New upper bounds on A^{n,d) 







best 




best upper 








lower 


new 


bound 








bound 


upper 


previously 


Delsarte 


n 


d 


known 


bound 


known 


bound 


7 


4 


250 


545 


554 


625 


7 


5 


53 


108 


125 


125 


8 


5 


160 


485 


554 


625 


9 


5 


625 


2152 


2291 


2291 


10 


5 


3125 


9559 


9672 


9672 


11 


5 


15625 


44379 


44642 


44642 


10 


6 


625 


1855 


1875 


1875 


11 


6 


3125 


8840 


9375 


9375 
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Chapter 5 
Covering codes 



Consider the following combinatorial problem. Given integers g, n and r, what is the 
smallest number of Hamming spheres of radius r that cover the Hamming space consisting 
of all g-ary words of length n? This covering problem is the dual of the packing problem 
from the previous chapter. Apart from being an aesthetically appealing combinatorial 
problem, it has several technical applications, for example to write-once memories and 
data compression. Another, down to earth, application is to betting systems. In many 
countries a popular game is played that involves forecasting the outcomes of a set of n 
(football)matches. Each match can end in three ways: a loss, a tie or a win for the hosting 
club. The goal is to find an efficient set of bets that is guaranteed to have a forecast with 
at most one wrong outcome. For this reason the covering problem in the case g = 3 and 
r = 1 is widely known as the football pool problem, see |20j . 

In this chapter we show how the method of matrix cuts from Chapter [6] can be applied 
to obtain new lower bounds on the minimum size of covering codes. For a survey of results 
on covering codes as well as many applications, the reader is referred to |14j . 

5.1 Definitions and notation 

Let q > 2 and n > 1 be integers. Let E := q" be the Hamming space consisting of all words 
of length n over the alphabet q := {0, 1, . . . , g — 1}. Recall that the Hamming distance 
d{u, v) of two words u, v G E is defined as the number of positions in which u and v 
differ. We define d{u, v) := {i,j, t) where i = d{u, 0), j = d(y, 0) and 2t = i + j — d{u, v). 
For a word u G E, we denote the support of u by 5'(u) := {i \ Uj ^ 0}. Note that 
[^(u)! = d{u,0), where is the all-zero word. Denote by 

Br{u) := {v gE I t/(u,v) < r} and (5.1) 

S,{u) := {veE\d{u,Y)=r} 

the ball and the sphere respectively, with center u G E and radius r. They are generally 
referred to as the Hamming sphere and the Hamming ring with center u and radius r in 
the literature. The covering radius of a code C (^ E is the smallest integer r for which 

y Br.{u) = E. (5.2) 

uec 

49 
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A code C C E is called an (n, K, q)r code if |C| = K and the covering radius of C is r. 
We denote 

Kq{n,r) := minjii' | there exists an {n,K,q)r code}. (5-3) 

In this chapter we will be interested in lower bounds on Kq{n,r). 

5.2 Method of linear inequalities 

An important tool used in deriving lower bounds on Kq{n,r) is the method of linear 
inequalities. Let C C E be a code and denote 

A{u) := \C n S,iu)\ (5.4) 

for u G E and i = 0, . . . ,n. We consider linear inequalities of a code. That is, valid 
inequalities of the form 

n 

Y^ XiAi{u) > P for all u G E, (5.5) 

where Aq, . . . , A„ > and /3 > 0. Such a set of inequalities is denoted by (Aq, . . . , A„)/3 
and leads to a lower bound on Kq{n, r) by the following proposition. 

Proposition 27. // any {n, K, q)r code satisfies (Aq, . . . , A„)/3 then 

K > ^n .(I, 7T7- (5.6) 

Ei=o^HJW-l) 



Proof. Summing |5.5| over all u G E we obtain 

n n 

1=0 ueE 

n 



ueE i=0 i=0 ueE 

n 



=0 vec 



ici|:.(';)(,-i, 



D 



The basic sphere covering inequalities 

r 

^ Ai(u) > 1 for all u G E (5.8) 

give the sphere covering hound 



5.2. METHOD OF LINEAR INEQUALITIES 
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Many other valid inequalities have been obtained, in particular in the binary case 
(g = 2), by studying the way the elements in Bs{u) can be covered for s = 1, 2, 3. In the 
case s = 1 this gives the van Wee inequalities 



r-l 

E 

i=0 



'n + 1 
r + 1 



Ai(u) + Ar(u) + Ar+l(u) > 



'n + 1 
r + 1 



(5.10) 



which improve upon the sphere covering bound whenever r + 1 does not divide n + 1. 
The case s = 2 leads to the pair covering inequalities found by Johnson [23] and Zhang 



r-2 



J]moA,(u) +mi(A,_i(u) + A,(u)) + A,+i(u) + A,+2(u) > mo, (5.11) 



j=0 



where 



mi 
mo 



maxj>2 



F{n-r+l,r+2)-F(n-iR+l,R+2) 



i-1 



(5.12) 



mi + F{n — r + 1, r + 2), 



and F{m, k) is the minimum number of k-sets needed to cover all pairs of an m-set. Other 
inequalities can be found in |7F] . 

Starting from a set of inequalities for a code, new inequalities can be obtained by 
taking nonnegative linear combinations. Also by summing the inequality (Aq, . . . , A„)/3 
over S'j(u), we obtain the induced inequality (Aq, . . . , A'„)/3', where 



Ai, 



P' ■■■- 



n 



a''- 



(g-W, 



and 



a. 



«j 



|{v|d(0,v)=z,rf(v,u)=j}| 



when (i(0, u) = k. The numbers af^ can be expressed as 

t+p=k+i-j 



a 



«j 



^..^.^.(.X":,') 



if g = 2. 



(5.13) 



(5.14) 



(5.15) 



Note that the bound obtained from an induced inequality is equal to the bound ob- 
tained from the original one. Using the fact that the Ai{u) are integers, the inequality 
(Ao, . . . , A„)/3 implies the inequality ( [Aq] , . . . , [A„] ) \f3] . This way the van Wee inequali- 
ties, for example, can be derived from the sphere covering inequalities as follows. Starting 
from the sphere covering inequalities, we obtain 



r-l 



^(n + l)Ai(u) + (r + l){Ar{u) + Ar+i{u)) >n + l for every u G E 



(5.16) 



j=0 



52 CHAPTER 5. COVERING CODES 

by summing the sphere covering inequahties over Bi{u). Then dividing by r + 1 and 
rounding up the coefficients, the van Wee inequahties are obtained. 

Using this method, Habsieger and Plagne obtained many new lower bounds in the 
binary and ternary case, by computer search see [T9] . 



5.3 Semidefinite programming bounds 

The bound from Proposition 27 may be viewed as a hnear programming bound as follows. 
Given A G M""^^ and /9 G M, define the polyhedron 

n 

P^^p ■= [x eMF- \Y^ \ix{S,{vL)) > P for aU u G E }. (5.17) 



4 = 



We have the following proposition. 
Proposition 28. 



Bq"' 
min{l^a; | x G Pa,/?} = ^n . /nw 7^1- (5-18) 

Proof. Observe that for any x G Pa,/3 also 

' V^' ^1 <jeAut(g,n) 

and X = cl where l^cl = l^x. Hence 

minjl^x I X G Pa,/?} = min{l"^cl | cl G Pa,/?} (5.20) 

n 

= min{g"c|^A,c|S,(0)|>/3} 

i=0 

= min{g"c | c > 



i=0 

/3 



Er=oA,.a)(g-i) 



EtoHi)i<i-ir 



D 



Clearly, replacing Pa,/3 by PA,/3n{0, l}'^ and considering the 0-1 optimization problem, 
can be expected to give a better lower bound. In fact, when (Aq, . . . , A„)/3 corresponds to 
the sphere covering inequalities, this 0-1 program gives the exact value i^q(ra,r)F| This 
motivates to replace the linear relaxation Pa^/j by a tighter (semidefinite) relaxation using 
the method of matrix cuts from Chapter 16} We will pursue this idea in the following. 



^In general there may be solutions that do not have covering radius < r, for example when n = 3, r = 1, 
the code {100,010,001} has covering radius 2 but satisfies the van Wee inequalities. 
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5.3.1 The first SDP bound 

In this section we derive a semidefinite programming lower bound on Kq{n,r) with 0{n) 
variables and 0{n) constraints. This bound is equal to the value obtained by minimizing 
l^x over A^+(Px,^), see Chapter pi 

To any code C C E, we associate the symmetric 0-1 matrix Mc defined by: 

I 1 if u, V G C, , , 

{McU^:=\ . . (5.21) 

I U otherwise. 

Let C C £■ be a code. Define the matrix 

M:=\Ant{q,n)\-' J^ M^c- (5.22) 

cr6Aut(g,n) 

By construction, the matrix M is invariant under permutations of the rows and columns 
by any a G Aut(g, n). Hence M is an element of the Bose-Mesner algebra of the Hamming 
scheme and we write 

n 

M = J]x,Ai, (5.23) 

where Ai is the z-th basis matrix of the Bose-Mesner algebra and xq, . . . , x„ G M. 

Proposition 29. The matrix M satisfies the following. 

(i) trM = \C\, (5.24) 

(ii) M >0 and R{M) h 0, 
(iii) // C satisfies (Aq, . . . , A„)/3, then 

Mu G Mu,uPx,p and diag(M) - M„ G (1 - M„,u)Pa,/3 

for every u G E. 

Proof. Since M is a convex combination of the M^-c, a G Aut(g, n), it suffices to observe 
that the contraints hold for each M„c- Clearly, trMo-c* = \C\ and M^^c > 0. As R{M(^c) = 
( Ic) ( Ic) ) R{Mfjc) is positive semidefinite. Finally, for any u G E 

(M.c)u = (M^cUuX''^ (5.25) 

and 

diag(M.c) - {M,c)u = (1 - iM^c)u,u)x'''' (5.26) 

and hence (iii) follows from the fact that aC satisfies (Aq, . . . , Xn)P for every a G Aut(g, n). 

D 

Below, we will make these constraints more explicit by expressing them in terms of 
the variables Xi. 
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Proposition 30. R{M) y is equivalent to 

n 

y XiPj{i) > for every j = 0, . . . , n (5-27) 

j=0 

and 



Proof. Since trM = g"xo and l^Ml = q"'Yl^=o^i{T)^1 ~ ■'-)*' ^^ follows from Proposition 
gthat R{M) ^ if and only if M ^ and 

q"' q"'Xo 



is positive semi definite. By Proposition 23 it follows that M = Y17=o-'^i^i ^^ positive 



semidefinite if and only if ^1^=0 ^i-Pji"^) — f*^^ every j = 0, . . . , n. D 

Proposition 31. Let x = Yl^=oXiX^'^^^ ^ ^^- Then the following are equivalent: 

n 

(i) Yl Xix{Si{u)) > (5 for every u G E, (5.28) 

j=0 
n n 

(ii) y^ Xj ■ y^ Koiij > /3 for every /c = 0, . . . , n. 

jr=0 i=0 

Proof. If d{u, 0) = k then 



n n 



1=0 



2.^ 

i=0 


2^ 2^ ^^- 

j=o veE 

d(0,v)=i 
d(u,v)=i 


n 

>> 

i=0 


n 

i=o 


n 


n 



(5.29) 



D 
Proposition 32. The following are equivalent 

(i) Mu G M^^^Px,p and (5.30) 

diag(M) - Mu G (1 - Mu,,)Pa,/3 
for every u G E, 



n n 



(ii) 5Za;,-^A,af,, >a:o/9 (5.31) 

j=0 j=0 

n n 

^(xo - Xj) ■ Y^ Xialj > (1 - xo)/3 

j=0 j=0 

/or every fc = 0, . . . , tt,. 
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Proof. Directly from Proposition 31 D 

Collecting all the propositions, we obtain the following theorem. 

Theorem 7. If every code C C E with covering radius r satisfies (Aq, . . . , A„)/3, we have 

Kq{n,r) > mm q'^Xo, (5.32) 

X 

where the minimum ranges over all x = (xo,Xi, . . . ,Xn)'^ G ]R"+^ satisfying 

(i) Xk > 0, (5.33) 

n 

(ii) ^XiPfc(^)>0, 

i=0 
n n 

(iii) X] ^' ■ X] ^i'^ti - ^^0' 

n n 

(iv) ^(xo - a;*) ■ ^ Ajttjj- > /3(1 - xq), 

i=0 i=0 

g" g"xo , ^ Q 



/or all k = 0, . . . ,n. 

Proof. D 

Observe that if we relax the semidefinite program by only requiring M to be positive 
semi definite instead of R{M) (that is: delete condition (v)), we obtain for a linear program 
in 0{n) variables and inequalities that is a lower bound on Kq{n,r). 

5.3.2 The second SDP bound 

In this section we describe a stronger semidefinite programming relaxation that uses more 
of the symmetry of the Hamming space, but requires 0{n'^) variables in the binary case 
and 0{n'^) variables in the nonbinary case. In this section we will focus on the binary case. 
The nonbinary case is very similar, although more complicated and it will be adressed in 
the next section. 

Restricting ourselves to the binary case, we have E = {0, 1}", the n-dimensional 
Hamming cube. Let C C E be any code and define the matrices M' and M" by: 

M' := |Aut(2,n)|-^ ^ M^c (5.34) 

o-eAut(2,n) 

oeo-c 



M" := |Aut(2,n)|-^ ^ M^c- 



o-eAut{2,n) 
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By construction, the matrices M' and M" are invariant under permutations a G Auto (2, n) 
of the rows and columns, that fix the element 0. Hence M' and M" are elements of the 
algebra ^2,n- Write 

M'=J2 <jMlv (5-35) 

ii,j,t) 

'^* ; are the zero-( 
expressed in terms of the coefficients xj • as follows. 

Proposition 33. The matrix M" satisfies 



where the matrices M* • are the zero-one basis matrices of ^2,n- The matrix M" can be 



Proof. The matrix 



M" = E (^°i-2*,o - <j)Mlr (5-36) 



M:=M' + M"= I Aut(2,n)r^ J2 ^oc (5.37) 

(TeAut(2,n) 



is invariant under permutation of the rows and columns by any permutation a G Aut(2, n), 
and hence is an element of the Bose-Mesner algebra, say 

M = Y,ykAk. (5.38) 

k 

Observe that for any u G E with (i(u, 0) = /c, we have 

Vk = (M),,o = (M')u,o = x%, (5.39) 

since (M")uo = 0- Hence we have 

M" = M-M' (5.40) 

k i+j-2t=k {i,j,t) 

which proves the proposition. D 

Proposition 34. The matrices 

are positive semidefinite. 
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Proof. Clearly, R{M(jc) = ( <tc) ( <tc) is positive semidefinite for each a G Aut(2,?2). 
Hence R{{xqq)~^M') and -R((l — x^q)~^M") are positive semidefinite as they are convex 
combinations of the R{M„c)- This implies the statement in the proposition. D 



Using the block diagonalisation of A2,n, Proposition 34 is equivalent to the following 
matrices being positive semidefinite 



n—k / „ \ n—k 



7 ^ Pi,i,k^i,i I ' I 7 ^ Pi,i.k\^i+^-'^t " 3^,' 



i+j-2t,0 -^iJJ 
, i=0 / ij=k \ *=0 / i,j=k 



for each fc = 1, . . . , [|J , 



.*=0 / i,j,=0 



where 



7_j Pi,7',0^'^''-'--i'— ■>+ n 3^^ 



.*=0 / i,j=0 



Xi := {xIq - x°o) ( ^ ) , for i = 0, . . . , n. 



Proposition 35. The coefficients x* • satisfy the following: 

2X0 = 1^1, (5.42) 

and for any i,j,t 

(i) < xl < xl, (5.43) 



n 



' j' ^/ (^) J) i + 3 ~ 2t) zs a permutation 



(_mj Xj ,,• = Xj, 

Proof. Since for any u G E 

\{a G Aut(2,n) | au = 0}| = |Auto(2,n)|, (5.44) 

we obtain 

_ |{aGAut(2,n)|0GaC}| _ ,^, |Auto(2,n)| _ ,_„|^| ,_ ._. 

"^O'"" |Aut(2,n)| "'^' |Aut(2,n)| "^ '^'- ^^'^^^ 

Inequalities (i) and (ii) follow from the fact that (M')u,u > (M')u,v and (M")u,u > (Af ")u,v 
for any u, v G E respectively. The truth of (iii) can be seen as follows. Let u, v G E be 
such that (i(u, v) = {i,j,t) and let {i',j',t') be such that {i,j,i+j — 2t) is a permutation 
of {i',j', i' + j' — 2t'). It can be seen that in that case there is a a G Aut(2, n) such that 
a{0,u,v} = {0,u',v'} with rf(u',v') = {i',j',t'). Hence 

x7 = (M')u,v = (M')u',v' = 4,j'- (5.46) 

n 



58 CHAPTER 5. COVERING CODES 

Given two words u, v G E with (i(u, v) = {i,j,t), we denote by «(*']) |n ^^ the number 

of words w G E with (i(u, w) = {i,j',t') and d{\,w) = d. This number is well-defined, 
and indeed we have the following proposition. 

Proposition 36. The numbers a,.''', ,,■. , are given by 

aoo,aoi,aio,aii ^/^ / \ / \ / 

where the indices aoo,aoi,aio and an range over the nonnegative integers that satisfy 

j' = aoo + ooi + oio + On (5.48) 

t = aio + Oil 
d — j = aoo + ^10 — oqi ~ ctii- 
Proof. Partition the support of the words w into four sets Aqq, Aqi, Aiq and An as follows: 



^1 
Aw 
An 



Aoo ■■= {k G 5(w) I Ufe = 0, V, = 0} (5.49) 

: {keSiw) |ufe = 0,Vfe^O} 

: {keSiw) |ufe^O,v, = 0} 

: {keSiw) |ufc^O,v, ^0}. 

If we denote the sizes of these four sets by aoo, oqi, aio and an respectively, we obtain the 
claimed result by summing over all possible sets Aoo,Aoi,Aio and An- □ 

Proposition 37. Let u G E &e a word with d{u, 0) = i and let x G M"^ be such that x^ 
only depends on (i(u,v), say x^ = x* •, when d{u,\) = {i,j,t). Then 

n 

J2 Ada;(5d(v)) > (3 for all^reE (5.50) 

is equivalent to 

n 

^ xiy ■ Y, ^d4:iw >p f^^ «" •^' ^- (5-51) 

j',t' d=0 

Proof. Let v G E and let d{u,\) = {i,j,t). Then we have the following equalities. 

n n 

^A,x(5,(v)) = 5^A, Yl ^w (5.52) 

d=0 d=0 weE 

d(v ,'w)=d 



E^^E E 

d=o j',t' weE 

d(v,w)=d) 
d(u,w)=(jj',t') 



•Jy\ 



E^^E 



(i,j,t) t' 

d=0 j',t' 

n 

f,t' d=0 
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D 

Proposition 38. // the code C satisfies the set of inequalities (Aq, . . . , A„)/3, then the 
variables xjj satisfy the following set of inequalities. For every tuple {i,j,t) 

Y.4r->^ri ^ <o/3 (5.53) 

_^(^i',0 ~ ^i,j') ' ^j',t' — (^0,0 ~ ^i,o)P 
i+j'-2t'fl + ^1,3') ' -^f't' — \^ ~ ^%,0 + ^i,o)P^ 

where we use the shorthand notation 



j'.t' 



j',t' 

_ „0 _ 
-^j'fi -^^ 



^K^=E^ 






d=0 



Proof. For any a G Aut(2,n), the matrix M := Mcrc satisfies 

diag(M)-M„ e il-M^^^)Px,i3 
for every u G E. 



(5.54) 



(5.55) 



Tliis implies tliat also the matrices -g— M' and t^^-q-M" satisfy (5.55) as they are convex 



combinations of the matrices M^^c- Now using Proposition 37 gives a proof of the claim. 

n 

This leads to the following semidefinite programming bound on K2{n,r). 
Theorem 8. // any code C C E with covering radius r satisfies (Aq, . . . , A„)/3, we have 



K2{n,r) >min2X 



0' 



(5.56) 



where the minimum ranges over all x = (x* •) satisfying (5.42), (5.45) and (5.53). 
Proof. n 

5.4 Nonbinary case 

In this section we consider the nonbinary case, that is g > 3. The nonbinary case is very 
similar to the binary case described in the previous section and we will skip some of the 
details in the proofs. 
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Again define the matrices M' and M" by 

M' := |Aut(2,n)|-^ ^ M,c (5.57) 

o-eAut(2,n) 

oeo-c 
M" := |Aut(2,n)|-i ^ M.c- 

o-eAut{2,n) 

The matrices M' and M" are invariant under permutations of the rows and columns by 
permutations a G Auto(5', n). Hence M' and M" are elements of the algebra Aq^n- We 
write 

(«,i,t,p) 
where the M-'f are the 0-1 basis matrices of the algebra Ag^n- The matrix M" can be 



^,J 



expressed in terms of the coefficients Xj'^ as follows. 



*j 



Proposition 39. The matrix M" is given by 



Proof. The matrix 



M"= Y.^^-t-vfl-^^'^- (5-59) 

(i.i.i.p) 



M:=M' + M" = I Aut(g,n)|-^ ^ M,c (5.60) 

o-GAut(g,n) 



is invariant under permutation of the rows and columns by permutations a G Aut(g, n), 
and hence is an element of the Bose-Mesner algebra, say 

M = Y,ykAu. (5.61) 

k 

Note that for any u G E with |iS'(u)| = /c, we have 

Vk = (M),,o = (M')u,o = 4:11> (5-62) 

since (M")u,o = 0. Hence we have 

M" = M-M' (5.63) 



r*.p 



A; i+j—t—p=k 
{i,j,t,p) 

which proves the proposition. D 
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Proposition 40. The matrices 

^ "^^ Uiag(M") M" ) ^^-^^^ 

are positive semidefinite. 

Using the block diagonalisation of Aq^n, the positive semidefiniteness of R and R' is 
equivalent to: 

for all a, k with 0<a<k<n + a — k, k^O the matrices (5.65) 

\ n+a~k 

^a{i,j,t,p,a,k)xl'^A 
*'P / i,j=k 

and 

\ n+a—k 

^ a(i, J, t, p, a, k) (x°;°,.„t_p g - x*'p J 

are positive semidefinite, and 

^a{i,j,t,p,0,0)xl'^A 

*'P / i,j=0 

and 

1 0,0 T 

X L 

are positive semidefinite, where 



L := I ^a(^,j,t,p,0,0)(x• 



n 



0,0 ^t,p 

i,j=0 



+j-t~p,0 ^i,jJ f 5 



^^ ■■= K;o - ^m) ( J (g - 1)' for ^ = 0, . . . , n. (5.66) 



Proposition 41. T/ie coefficients x/^ satisfy the following. 

q-xl, = \C\, (5.67) 

anrf /or any i, j, t, p 

(t) < xg < x:;:, (5.68) 

/••) 0,0 I 0,0 _ 0,0^ t,p ^ 0,0 

(Hi) Xj'^ = x^,'"^, z/ (i, j, i + j — t — p) is a permutation of 
{i\i\ i' + j' -t' - p') and t - p=t' - p' . 
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Proposition 42. Let u,v G E be words with d{u,\) = {i,j,t,p) and let {i,j',t',p') and 
d be given. Then the number Oiu^Pp/) ^ of words w G E with d{u,w) = {i,j',t,'p') and 
d{\, w) = d is given by 



a 



{ij',t',p'),d 



E 

ai,a2 

bi,b2 

C1,C2 

di,d2,d3 

e 



i-t 

0,1, 02 



J-t 



P \j t-P 
Ci,C2/ \di,d2, ds 



n + t — i 
e 



{q - lY{q - 2y'+^'+''{q - S^'' , (5.69) 



where the indices ai, 02, 6i, 62, Ci, C2, (ii, (i2, c^s and e range over the nonnegative integers 
that satisfy 



(5.70) 



/ = oi + 02 + bi + b2 + ci + C2 + di + d2 + ds + e 

t' = oi + 02 + ci + C2 + di + d2 + ds 

p' = ai + ci + di 

d = Oi + 02 + e + j — bi — ci ~ d2. 

Note that in the case g = 3 we adopt the convention that 0° = 1. 

Proof. Partition the support of the word w into sets Ai, A2, -Bi, B2, Ci, C2, -Di, -D2, -D3 
and E as follows 

Ufc 7^ 0, Vfe = 0, Wfe = Ufe} (5.71) 

Ufc 7^ 0,Vfc = 0,Wfe 7^ Uk} 

Ufc = 0,Vfc 7^ 0,Wfe = Vfc} 

Ufc = 0,Vfc 7^0,Wfc 7^ Vfe} 

Ufc 7^ 0,Vfe = Ufc,Wfc = Ufc} 

Ufc 7^ 0,Vfc = Ufc,Wfc 7^ Ufc} 

Ufc 7^ 0, Vfc 7^ 0, Ufc, Wk = Uk} 

Ufc 7^ 0, Vfc 7^ 0, Ufc, Wfc = Vfc} 

Ufc 7^ 0, Vfe 7^ 0, Ufc, Wfc 7^ Ufc, Vfc} 

Ufc = 0,Vfc = 0}. 

If we denote the sizes by ai, a2, 61,62, ci, C2, rfi, ^2, ^3 and e respectively, we obtain the 
proposition by summing over all possible sets Ai, A2, -Bi, -B2, C*!, C2, -Di, -D2, -D3 and 



Al 


= {ke S{w) 


A2 


= {keSiw) 


Bi 


= {ke S{w) 


B2 


= {ke S{w) 


Cr 


= {keSiw) 


C2 


= {ke S{w) 


Di 


= {ke S{w) 


D2 


= {keSiw) 


Ds 


= {ke S{w) 


E 


= {ke S{w) 



Proposition 43. Let u G E 6e a word with |5'(u)| 
only depends on d{u,\), say x^ = x^'^, when (i(u, v) 



i and let x G M.^ be such that x^ 
{hj,t,p). Then 



J2 Ada;(5d(v)) > f3 for allweE 



(5.72) 



d=0 
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is equivalent to 

n 

Yl ^if ■ Yl ^dO^i'^l'U > /5 for every j, t, p. 

j',t',p' d=0 

Proof. Let v G E and let d{u,\) = {i,j,t,p). Then we have the following equality. 



(5.73) 



J2^MS,M) = E E 






d=0 



d=0 wGE 
d{v ,w)=d 

n 
d=0 j',t',p' 



iJ' ^{i,j',t',p'),d 






),d- 



j',t',p' 



d=0 



(5.74) 



(5.75) 



(5.76) 



D 



Proposition 44. If the code C satisfies the set of inequalities (A,/3), then the variables 
Xj'^ satisfy the following set of inequalities. For every tuple {i,j,t,p) 



j',t',p' 

f,t',p' 

j',t',p' 

E/ 0,0 _ 0,0 _ 0,0 I t',p'\ , 

1^0,0 ^j'fl ^i+j'-t'-p'fi + ^ij' ) ■ ^ 

j',t',p' 
where we have used the shorthand notation 



i,j',t',p') — •^iflf-' 

i,j,t,p) > |'^0.0_ 0,0x^ 

i,j',t',p') — '^■^0,0 ■^ifl)'-' 



hht,p) > |'^0,0_ 0,0x/o 

i,j',t',p') — V-^0,0 ■^ifijl-' 



i,j',t',p') — \^ '^■^0,0^ -^1,0 J I-' 



(5.77) 



^{i,j',t',p') ■~ / . '^d^a.i'.t'.v' 



),d- 



(5.78) 



d=0 



Proof. For any a G Aut(g, n), the matrix M := M^rc satisfies 

Mu G Mu,uP\,i3 and diag(M) - M^ G (1 - Mu,u)P\,/^ for every u G E. (5.79) 



This implies that also the matrices o,d.„ and — ^ satisfy (5.79) as they are convex 



combinations of the matrices M^c- Now using Proposition [43] gives a proof of the claim. 

D 

Theorem 9. // any code C C E with covering radius r satisfies (Aq, . . . , A„,)/3, we have 

Kg{n,r)>mmq''xl'o, (5.80) 



where the minimum ranges over all x = (x/p satisfying (5.65), (5.68) and (5.77). 
Proof. 



D 
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5.5 Computational results 



Using the sphere covering inequahties, we obtained a number of exphcit new upper bounds 
in the case g = 4 and g = 5. The resultqj are shown in table 5A and 5^ below. The 
upper bounds and previous lower bounds are taken from the website of G. Keri ( [24] ) . 
who maintains an updated table of upper and lower bounds on covering codes. In the 
binary and ternary case, no new lower bounds were found. 



Table 5.1: New lower bounds on K^ln, R) 







best 




best lower 








upper 


new 


bound 


Sphere 






bound 


lower 


previously 


covering 


n 


R 


known 


bound 


known 


bound 


7 


1 


1008 


762 


752 


745 


11 


1 


131072 


123846 


123362 


123362 


9 


2 


1024 


748 


747 


745 


10 


2 


4096 


2412 


2408 


2405 


11 


2 


16128 


7942 


7929 


7929 


11 


3 


2048 


843 


842 


842 


9 


4 


64 


22 


21 


21 


11 


4 


512 


134 


133 


133 


11 


5 


128 


31 


30 


30 


11 


6 


32 


10 


9 


9 



^In the instance i? = l,n = llwe were unable to solve the second SDP. The given number is the 
bound obtained from the first SDP. 
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Table 5.2: New lower bounds on K^{n, R) 







best 




best lower 








upper 


new 


bound 


Sphere 






bound 


lower 


previously 


covering 


n 


R 


known 


bound 


known 


bound 


^7 

1 


1 


3125 


2722 


2702 


2694 


8 


1 


15625 


11945 


11887 


11838 


9 


1 


78125 


53138 


52800 


52788 


10 


1 


390625 


238993 


238200 


238186 




2 


115000 


52842 


52788 


52788 




3 


21875 


4253 


4252 


4252 




4 


3125 


510 


509 


509 




5 


625 


87 


86 


86 




6 


125 


21 


20 


20 
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Chapter 6 
Matrix cuts 



In Chapter |4] we discussed the problem of finding good upper bounds on the maximum 
size of a code with certain distance constraints. This is a special case of the general 
problem to find bounds for the stability number of a graph. There exist general methods 
for bounding the stability number. In this chapter we explore the relationship between 
these general methods, when applied to codes, and the method from Chapter |4j 
Recall that for any symmetric matrix A G M"^"', the matrix R{A) is defined by: 

n(A) := (^ «;) , (6.1) 

where a := diag(A) is the vector of diagonal elements of A. We will index the extra row 
and column of R{A) by 0. Denote the convex set of symmetric matrices 

Un := {A e M"^" I R{A) >z 0}. (6.2) 

Observe that for A G 7^„, the entries of A belong to [—1, 1]. Indeed, let i,j G {0, . . . , n}. 
The principal submatrix 

of R{^Al) indexed by and i is positive semidefinite. This is equivalent to A\^ < 1 ■ Ai^i, 
which implies that Ai^i G [0, 1]. For i < j the semidefiniteness of the principal submatrix 
of R{A) indexed by i and j 

f' fA (6.4) 

implies that A'^j < Ai^iAjj < 1 and hence Aij G [—1, 1]. We define the projection p{Ai) 
of aset MC 7^„ and the' lift 1{K) of a set iT C [0, 1]" by 

p(M) := {diag(A) \ A e M} (6.5) 

1{K) := {A G 7^„ I diag(A) G K}. 

By the previous remarks we see that p{Ai) C [0, 1]" for Ai C 7?.„. Observe that for any 
K C [0, 1]" we have p{l{K)) = K. Indeed, if x G [0, 1]", the matrix 

j( j +Diag(0,xi -Xi,...,x„-x^) (6.6) 

67 
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is a positive semidefinite matrix of the form R{A) with diagonal x. Conversely, we only 
have l{p{M)) D A^ for A^ C 7^„. 

In the following, the idea will be for a given convex set K, to find approximations of the 
convex hull of the 0-1 points in K. The method will be to describe these approximations 
as the projection of set in the larger space 7^„. The most prominent example is the 
so-called theta body of a graph, and the associated Lovdsz theta number. 

6.1 The theta body TH(G) 

Let G = (V, E) be a graph. We will assume that the vertex set is given by \^ = {1, . . . , n}. 
Define the set M (G) by 

M{G) := {A G 7^„ I Aij = if {i,j} G E}. (6.7) 

The projection 

TH(G) := p{M{G)) = {diag(A) | A G M{G)} (6.8) 

was defined in [TTj and is referred to as the theta body of G. The number 

79(G) := max{l^x | x G TH(G)} (6.9) 

was introduced by Lovasz in [31] as an upper bound on the Shannon capacity of the 
graph G. Although we will not be concerned with Shannon capacities, the following two 
properties of '(9(G) are relevant to our discussion: the number t9(G) can be approximated 
in polynomial time, and gives an (often close) upper bound on the stability number a{G). 
This last fact follows since for every stable set S" C V in the graph G, the matrix x^ix^V 
belongs to Ai{G). The theta body gives a good approximation of the stable set polytope. 
In particular, for perfect graphs G, equality holds, implying that the stability number can 
be calculated in polynomial time for perfect graphs. 

The following strengthening of the theta body was given by Schrijver in [36]. Define 

M'{G) ■.= {AeTZn\A> 0,Aij = if {z,j} G E}, (6.10) 

and 

TH'(G) :=p(Al'(G)). (6.11) 

Again the number 

-d'iG) := maxjl^a; | x G TH'(G)} (6.12) 

gives an upper bound on a{G) and clearly {}'{G) < i!}{G). We note that {}'{G) can be 
alternatively defined by 

^'(G) = max{ l^Al \ A G M^^",trA = 1, (6.13) 

Aij = when {i,j} G E}, 

and similarly for ^(G). The equivalence of the two definitions follows from Propositions 
g and |8| in Chapter |2| (see also [37] ) . 
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It was shown in [36] that for association schemes, the number '(9'(G) corresponds to the 
Delsarte bound. Given a scheme {X, R) with adjacency matrices I = Aq, Ai, . . . , A^ and 
M C {1, . . . ,n} we are interested in the maximum size of an M-chque, that is a subset 
S (^ X with the property that {Ai)x,y = for all x,y E X and i ^ M. Consider the graph 
G = {X, E), where E = {{x, y} \ {Ai)x,y = 1 for some i ^ M}. Then the stable sets of G 



are precisely the M-chques of the scheme {X,R). By (6.13), the upper bound t9'(G) on 
the maximum size of a stable set in G is given by 

maxjl^Al I A G M>(f^,trA = l,Aij = when {z,j} G E}. (6.14) 

We will sketch a proof that this maximum equals the Delsarte bound. The proof consists 
of two ideas. 

Proof. First, we may restrict the range of A in the program to the matrices in the Bose- 
Mesner algebra, without decreasing the maximum. Indeed, let vr denote the orthogonal 
projection onto the Bose-Mesner algebra (as a subspace of M"^^^) given by 

where the matrices Eq, . . . ,En are the orthogonal idempotents of the scheme. Since the Ei 
have eigenvalues and 1, they are positive semidefinite. Hence for positive semidefinite 
A the projection tcIA) is a nonnegative combination of positive semidefinite matrices, 
and hence again positive semidefinite. Furthermore, vr preserves the inner product with 
matrices in the Bose-Mesner algebra. In particular 

tT7T{A) = {I,7r{A)) = {I,A)=tTA (6.16) 

l'^7c{A)l = {J,7r{A)) = {J,A) = 1^A1 

{Ai, n{A)) = {Ai, A) = for z ^ M 

{Ai,'K{A)) = {Ai,A)>0 for i = 0,...,n. 

It follows that tt{A) is a feasible point with the same objective value as A. 
Secondly, writing 

n 

A = J2xiA, (6.17) 

i=0 

where Ai := {Ai,Ai)~ Ai, the program becomes 

maxj^Xj I xo = l,Xi > for 2 G M,^XiAi >z 0}. (6.18) 

Since Ai = Yl^=o Qj,i {^ji ^j)~ ^ji where Q is the second eigenmatrix of the scheme, the 
positive semidefinite constraint reduces to linear constraints 

^ x.Qj^i > for j = 0, . . . , n. (6.19) 

n 
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We remark that when the Bose-Mesner algebra is the centrahzer algebra of its au- 
tomorphism group, for example in the case of the Hamming schemes and the Johnson 
schemes, the orthogonal projection vr satisfies 



7r(A) = |r|-i^aA, (6.20) 

crer 

where F denotes the automorphism group of the scheme. 

6.2 Matrix cuts 

In [32], Lovasz and Schrijver introduced a general lift and project method for strengthening 
approximations of 0-1 polytopes. Given a convex body K contained in the unit cube 
[0, 1]", a convex body N^{K) is constructed such that 

K D N+{K) D N+{N+{K)) D---D N^+\k) = Kn{0, 1}". (6.21) 

An important property of the operator A^+ is that for a family /C of convex bodies, if 
one can optimize in polynomial time over K for each K & K,, then also the optimization 
problem over N^{K) is polynomial time solvable for K & IC. An important instance 
is when G = (V, E) is a perfect graph and K = FRAC((j') is the fractional stable set 
polytope of G. In that case one iteration of the A^+ operator suffices to obtain the stable 
set polytope STAB(G') := FRAC(G) n {0, 1}^. 

We start by describing the lift-and-project-method of Lovasz and Schrijver and prove 
some of the basic properties of the operator N^. The idea is to lift a convex set K C [0, 1]*^ 
to a convex set in the space of symmetric positive semidefinite n x n matrices and then 
to project it back into [0, 1]". 

For MCTZn, define the set N{M)hY 

N{M) := {A G 7^„ I for i = 1, . . . , n there are U,V e M (6.22) 

such that Ai = Ai^i ■ diag([/), 
diag{A)~A = {l-A^,)dmg{V)}. 

The operator A^^_ is now defined as 

N+iK):=piNiliK))). (6.23) 

Clearly 

A^+(ir)C[0,l]^ (6.24) 

since ii R{A) is positive semidefinite diag(y4) G [0, 1]" as we have seen before. Furthermore, 
we have: 

N+{K) C K. (6.25) 

Indeed, if A G N{l{K)), then for any i = 1, . . . ,n we have: 

diag(A) = A + (diag(A) - Ai) G Ai^i ■ K + (1 - A,i) ■ K (6.26) 
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and hence diag(A) G K, since Ai^i G [0, 1]. Note that this argument shows that in fact 

N+{K) C conv.hulljx e K \ x,, E {0, 1}} (6.27) 

for i = 1, . . . n since for each i we have 

Ai G Ai^i ■{xeK\xi = l}, diag(A) - A^ G (1 - A^i) ■{xeK\xi = 0}. (6.28) 

By induction it then follows that 

Nl{K)C{0,l}''nK. (6.29) 

On the other hand, A^_|_ does not cut off any integer points: 

{0,lVr]K CN+{K), (6.30) 

since for any x G {0, 1}"" fl K the matrix xx"*" belongs to N{l{K)). The operator A^+ was 
introduced in [32], see also [37] . 

Let G = {V, E) be a graph and let FRAC(G') denote the fractional stable set polytope 
of G, that is: 

FRAC(G) := {x G M^ I X > 0,Xi + Xj < 1 for any edge {i,j} G E}. (6.31) 

We observe that A^+(FRAC(G')) is contained in the modified theta body TH'(G). Indeed, 
if A G A^(/(FRAC(G))), then Aij = for any edge {i,j} since Ai G Ai^i-FRAC{G) implies 
that 

A,i + A,j < Ai^i ■ 1. (6.32) 

We will also consider the operator A^ given by 

N{M) := {AeUnl ioTi = l,...,n there are (6.33) 

UeA,M,Ve{l-A^,)M 

such that Ui^i = Ai^i and A = U + V}. 

Clearly N{M) C N{M) for any M C 7^„. We show that any x G p{M) H {0, 1}" belongs 
to p{N{Ai)). Let A E Ai have diagonal x G {0, 1}". Then for i = 1, ... ,n we have 
A = Ai^iU + (1 - Au)V, where we take f/ = A, 1/ = if Ai,i = 1 and [/ = 0, 1/ = A if 
Ai^i = 0. The set A^(A^) can alternatively be described by: 

N{M) = {A\Ae conv.hulljM G M \ Mi^ G {0, 1}} for each z = 1, . . . , n}. (6.34) 

Proof. Let A G A^(A^). Let i G {1, . . . ,n} and let f/, V be as in the definition. Observe 
that R{A) is positive semidefinite, since A = U + V e Ai^iM + (1 - Ai^i)M = A^. We 
prove that 

Ai = di&g{U) and diag(A) - Ai = diag(V"). (6.35) 



If Aj j = we have Ai = since A is positive semidefinite, and (6.35) follows. Hence we 
may assume that Ai^i > 0. Notice that V^.^j = and hence Vi = 0. Since A^lUi^i = 1 it 
follows from the positive semidefiniteness of R{A~lU) that f/j = diag(f/). Hence A^ = 
Ui + Vi = diag([/) and diag(A) - Ai = diag(f/) Ui + diag(r) + Vi = diag(r). D 
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6.3 Bounds for codes using matrix cuts 

Fix integers 1 < li < n and q > 2, and fix an alphabet q = {0, 1, . . . , g — 1}. The Hamming 
distance d{x, y) of two words x and y is defined as the number of positions in which x and 
y differ. Let G = {V,E) be the graph with V = q", where two different words x,y &V 
are joined by an edge if x and y differ in at most d — 1 position. The stable sets in G 
are precisely the g-ary codes of length n and minimum distance at most d. The stability 
number of G equals Aq{n, d). Define 

M' ■.= {Aenv\A> 0, A^^y = if {x, y} e E}, (6.36) 

and let 

TH'(G) := p{M') = {diag(A) | A G M'} (6.37) 

denote the modified theta body of G. Maximizing the all-one vector over TH'(G) gives 
an upper bound on Aq{n,d), which we have seen, equals the Delsarte bound. A tighter 
upper bound can be found by maximizing the all-one vector over the smaller convex set 
iV+(TH'(G')): 

maxjtrA | A E N{M)}. (6.38) 

Using the symmetries of the graph G, this can be made more explicit as follows. 

Denote by Aut(g, n) the set of permutations of q" that preserve the Hamming dis- 
tance. It is not hard to see that Aut{q,n) consists of the permutations of q" obtained 
by permuting the n coordinates followed by independently permuting the alphabet q at 
each of the n coordinates. The group Aut(g,n) acts on the set oi V x V matrices in the 
following way. For a G Aut(g, n) and A G M^^^ define (t{A) by 

{a{A))^^,^y = A^,y. (6.39) 

The matrices in M^^^ that are invariant under this action of Aut(g, n) are precisely the 
adjacency matrices Aq, Ai, . . . , A„ of the Hamming scheme H{n, q) defined by 

Jl iid{x,y) = i, 
I otherwise, 

for i = 0, 1, . . . , n and the Bose-Mesner algebra of the Hamming scheme. 

In the following calculations, it will be convenient do define for a square matrix A and 
a positive real number c the matrix R{c\ A) by 

Observe that /?(!; A) = R{A) and R{c; A) is positive semidefinite if and only if i?(c~^A) is 
positive semidefinite. Since G is invariant under the permutations a G Aut(g, n), also Ai', 
N{M') and iV+(TH'(G)) are invariant under the action of Aut(g,n). Hence if A G N{M') 
maximizes trM over all M G N{Ai'), also 



|Aut(fl,n)| 



Y, ^(^) e N{M') (6.42) 
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is a maxiinizer. Hence the inaximum in (6.38) is equal to 



maxjtrA | A G N{Ai') is in the Bose-Mesner algebra}. (6.43) 

If y4 is a matrix in the Bose-Mesner algebra, all rows of A are equal up to permuting 
by elements of Aut(g, n). Hence since Ai' is invariant under these permutations, the 
maximum is equal to 

n 

max{g" ■ xq \R{y^XiAi) is positive semidefinite 

i=0 

and there exist U e xq ■ M' and V e {I - xq) ■ M' such that (6.44) 
Uu,u = Xi if d{u, 0) = z and 
Vu,u = Xq- Xi if d{u, 0) = i]. 



Note that if U and V are as in (6.44), and a G Aut(g,n) fixes the zero word, then (j{U) 
and (y{y) satisfy the same constraints. Hence U may be replaced by 

n 7 ^ y ('iU) (6.45) 

' "^^' '^ <TeAuto(9,n) 

and similarly for V . Here Auto(g,?T,) denotes the set \o G Aut(g,n) | cr(0) = 0}. Hence 
we may impose that f/ and V are elements of the Terwilliger algebra without changing 
the maximum. We obtain 

n 

maxlg" ■ xq \R\y^XiAi) is positive semidefinite, 

R\y^ y^jM^'J), R{y^ ^ij^i'J) ^^6 positive semidefinite, 

i,j,t,p i,j,t,p (6.46) 

Xi = xovl'}, Xq - Xi = {1 - Xo)zll (i = 0, . . . , n), 
t,p t,p ^ ri 

VkI = 4i = if ^ + J - t -p G {!,. . .,d-l}}. 

Since Xi = Xq ■ yl'\ we can eliminate the variables Xj from this program by substituting 

^^ := Xo-y:;J (6.47) 

i*'P := (l-xo)4j- 

We obtain the following semidefinite program (where we have dropped all the tilde's from 
the variables): 



max{g" ■ y^'^ |_R(N^?/*'*Aj) is positive semidefinite, 

i=0 



i,i,t,p i,j 

0,0 i.i i,i I' ■ r\ \ 

Vo'o - yi,i = Zi,i (z = 0,...,n), 

t,p t,p ^ r\ 



(6.48) 



t,P _ ^t,p _ if ^ + ^- _ t - p G {1, . . . , d - 1}}. 
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If we use the stronger operator A^ in stead of A^ we arrive in a similar fashion at the 
program 

me.x{q- ■ Xo \R{xo; J2 4'Kf )' ^(^ " ^o; ^ 4f <f ) ^ 0' 

i,j,t,p i,j,t,p 

0,0 _ t,p . t,p _ (6.49) 

yl^,4^>0,y^^ = zl^ = 0iiz+j-t-pe{l,...,d-l}} 

it follows from ^q'q + Zq'q = xq and i/qq = Xq that ^g'o = 0- Since for each feasible solution 
the matrix 

^^=E^*Mf (6-50) 

is positive semidefinite, it follows from Mo,o = -Zo'o = that = Mu^ = z^^ when u 
has weight i. Hence 

0,0 I 0,0 0,0 (a c:i\ 

Xi = Vifl + hfi = Vifl ■ (6-51) 

This implies that we can eliminate the variables Xi and z^'^ by using 

0,0 t,p 0,0 t,p In roN 

^i = y^fl ' hj = Vi+j-t^pfl - Vij (6.52) 

for all i,j,t,p. We obtain the following semidefinite program: 



max{g" ■ yjo \R{yQ'^o; ^ Vi^M-'J), R{1 - yjo; ^ (2/°;Vt-P,o " yTj)^l'!j) ^ 0' 



t,p 0,0 ^ t,p 



(6.53) 



%,f = ?/°;Vp,o = Oif2+j-t-pe{l,...,c?-l}} 



ViJ ' Vi+j-t-pfl ~ Vi.j — '-'' 
t,p _ 0,0 

h,j — Vi+j-t-pfl 
This program may be further simplified by observing that for a matrix A with Ai i = 1 

R{A) ^ if and only if A ^ 0, Ai = diag(A). (6.54) 

We finally obtain 

t,p 0,0 t,P ^ n / V 

Z/jj , 2/i+j-t-p,o - Vi^j > 0, (6.55) 

0,0 _ i,i 
Vifl ~ yi,ii 

i/:;f = 2/°-S_*-p,o = oif^ + j-t-pG{i,...,rf-i}} 

This bound is very similar to the bound (4.38) derived in Chapter 111 that is to say the 



impoved version of Schrijver's bound given by Laurent. The main difference is that in 



(6.55) the symmetry conditions 



yTi = yi'i "^^^"^ t-p = t' -p' and (z, j,i + j - t - p) is a 
permutation of (i', j' , i' + f — t' — p') 

are lacking. It can be seen from the computational results in given in the next section, 
that these conditions make a huge difference in the resulting bound. 
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6.4 Computational results 

In this section we give some computational results on the different bounds we obtain and 
compare them to the bound proposed by Schrijver (see [38], [16]) with the improvement of 



Laurent. That is, the bound obtained from (4.38). Each of the bounds can be computed 



in polynomial time in n by block diagonalising the Terwilliger algebra of the Hamming 
scheme for each q and n. 

From the tables below it follows that we can have the strict inequality 



N{M) C N{M). 



(6.57) 



Table 6.1: Bounds on A^^rijd) 







best 
lower 


best upper 
bound 




bound 


bound 


bound 


n 


d 


bound 
known 


previously 
known 


Delsarte 
bound 


from 

(6.48) 


from 

(6.55) 


from 

(4.38) 


6 


3 


38 


38 


48 


48 


48 


46 


7 


3 


99 


111 


145 


145 


144 


136 


8 


3 


243 


333 


340 


340 


340 


340 


9 


3 


729 


937 


937 


937 


937 


937 


7 


4 


33 


33 


48 


48 


48 


44 


8 


4 


99 


99 


139 


139 


139 


121 


9 


4 


243 


297 


340 


340 


339 


324 


10 


4 


729 


891 


937 


937 


937 


914 


11 


4 


1458 


2561 


2811 


2811 


2805 


2583 


12 


4 


4374 


7029 


7029 


7029 


7029 


6839 


6 


5 


4 


4 


5 


5 


4 


4 


7 


5 


10 


10 


15 


15 


14 


13 


8 


5 


27 


27 


41 


41 


41 


33 


9 


5 


81 


81 


90 


90 


90 


86 


10 


5 


243 


243 


243 


243 


243 


243 


11 


5 


729 


729 


729 


729 


729 


729 


12 


5 


729 


1562 


1562 


1562 


1562 


1557 


7 


6 


3 


3 


4 


4 


4 


4 


9 


7 


6 


6 


7 


7 


7 


7 


10 


7 


14 


14 


21 


21 


21 


21 


11 


7 


36 


36 


63 


63 


62 


49 


12 


7 


54 


108 


138 


138 


138 


131 



Remark: We calculate the Delsarte bound by maximizing xo-q"' (the trace of Yli XiMi) 
under the condition that the x, are nonnegative and R{Yli ^iMi) is positive semidefinite. 
This turns out to give a more stable semidefinite program than setting xq = 1 and 
maximizing ^jXj(")(g - 1)*. 
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Table 6.2: Bounds on ^4(71, d) 



n 

8 

9 

10 



best 
lower 
bound 
d known 
4 128" 
4 320 
4 1024 
4 4096 



best upper 

bound 

previously 

known 

179" 

614 

2340 

9360 



Delsarte 
bound 

614 
2340 
9362 



bound 
from 



(6.48) 



179 

614 

2340 

9362 



bound 
from 



(6.55) 



179 

614 

2340 

9360 



bound 
from 



(4.38) 



169 

611 

2314 

8951 



9 
10 



32 

70 

256 

1024 



32 

128 

512 

2048 



40 

160 

614 

2145 



40 

160 

614 

2145 



40 

160 

614 

2145 



39 

147 

579 

2045 



10 

11 
12 



256 
1024 
4096 



512 
2048 
6241 



512 
2048 
6241 



512 
2048 
6241 



511 
2047 
6241 



496 
1780 
5864 



10 
12 



40 
256 



80 
1280 



112 
1280 



112 
1280 



111 
1280 



106 
1167 



Table 6.3: Bounds on A^{n,d) 







best 


best upper 














lower 


bound 




bound 


bound 


bound 






bound 


previously 


Delsarte 


from 


from 


from 


n 


d 


known 


known 


bound 


( 


6.48 


) 


( 


6.55 


) 


( 


4.38 


) 


6 


4 


125 


125 


125 


125 


125 


125 


7 


4 


250 


554 


625 


625 


623 


545 


8 


4 


1125 


2291 


2291 


2291 


2291 


2291 


9 


4 


3750 


9672 


9672 


9672 


9672 


9672 


10 


4 


15625 


44642 


44642 


44642 


44642 


44642 


11 


4 


78125 


217013 


217013 


217013 


217013 


217013 


7 


5 


53 


125 


125 


125 


124 


108 


8 


5 


160 


554 


625 


625 


623 


485 


9 


5 


625 


2291 


2291 


2291 


2291 


2152 


10 


5 


3125 


9672 


9672 


9672 


9672 


9559 


11 


5 


15625 


44642 


44642 


44642 


44642 


44379 


8 


6 


45 


75 


75 


75 


75 


75 


9 


6 


135 


375 


375 


375 


375 


375 


10 


6 


625 


1875 


1875 


1875 


1875 


1855 


11 


6 


3125 


9375 


9375 


9375 


9375 


8840 


11 


9 


25 


35 


45 


45 


45 


43 



Chapter 7 
Further discussion 



In this chapter, we present some further observations, and notes related to the methods 
from previous chapters. 



7.1 Bounds for affine caps 

Let AG(/c, q) be the fc-dimensional affine space over the field GF^. A subset A C AG(A;, q) 
is called an affine cap if no three elements of A are on an affine line, that is, any three 
different vectors in {(^) | a G A} are linearly independent. We denote by Ck{q) the 
maximum cardinality of an affine cap in AG(fc, q). 

The effectiveness of the semidefinite programming approach for error correcting codes, 
suggested that we could, more generally, find good bounds for the size of a code where we 
forbid the occurence of triples of code words in some prescribed configuration. Indeed the 
variables x/^ in the semidefinite program correspond exactly to the number of tripes in a 
code, for each equivalence class under automorphisms of the Hamming space. One may 
be led to wonder if setting those variables that correspond to forbidden configurations to 
zero, would yield good upper bounds in general. This is an appealing idea. Unfortunately, 
it turned out to be false in general. 

A prominent structure that might be approached this way are affine caps over the field 
of three elements. The only known values are Ci(3) = 2, C2(3) = 4, 6*3(8) = 9, Ciii) = 20 
and C5(3) = 45. In 0] the general bound ^^(3) < S'^^ was shown. A code A C AG(A;, 3) 
is an affine cap if and only if for any three elements u, v,w G A we have (i(u, v,w) ^ 
(z, i, i, 0) for every i = 1, . . . ,k. Recall that 

o?(u,v,w) := {i,j,t,p), where (7.1) 

i := (i(u, v), 
j :=rf(u,w), 

t := \{i \ Ui ^ \i and u^ ^ Wj}|, 
p := \{i I Ui 7^ Vi = Wi}|. 

Consider the following semidefinite program. 
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^PJ2*x°;o subject to (7.2) 



maximize 

1=0 

(i) ^o;o = 1 
(ii) < x*;^ < xlo 

(iii) Xjj = x-,j, at — p = t' — p and 

{i, j,i + j — t — p) is a. permutation of (?', j', i' + j' — t' — p') 
(iv) x*'j = for z = 1, . . . , n. 

(v) Y^ ^Tj^ij^ Yl (^°+i-t-P,o - ^W^ij ^^^ positive semidefinite. 

id,t,P i,j,t,P 

Clearly, this gives an upper bound on C„(3). We have the following result. 
Proposition 45. The maximum in (7.2) equals 1 + ^^-^• 



Proof. Setting the variables x/^ as follows: 



^,3 



t,p 



'I i{i=j = t=p = 0, 

I ifz=j = t=P7^0or exactly one of i, j is zero, 

iii=j = ty^O and p = 0, 

^ I otherwise 



(7.3) 



3"-l 



1 


M„,„ 


M_,,_, 


Mu,u 


M„,u 





^-u,-u 





M_,,_,. 



gives a feasible solution with objective value equal to 1 , ^ • 

On the other hand, let any feasible solution be given. Then the matrix M' : = 
'^ijtp^ij^i'J is positive semidefinite. Hence for any nonzero word u the 3x3 prin- 
cipal submatrix indexed by the words 0, u, — u is positive semidefinite and equals 



(7.4) 



This implies that (Mu,u - MlJ{M.u,-u - Ml^_J > (Mu,uM_u,-u)^- Hence M^.u + 
-^-u-u < 1- Since the objective function equals the trace of M, the value is at most 

This bound is very poor. The same bound already follows from the fact that if G A, 
for every nonzero word u not both u and — u can belong to A. 

7.2 Notes on computational results 

The computational results from Chapters |4] and [5] have been obtained by using CSDP 
version 4.7 (see [7]) and SDPT3 (see jl2])- Both are SDP-solvers and can be accessed 
also through the NEOS server, see 
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www-neos . mcs . anl . gov/ 

. The semidefinite programs were generated by perl scripts in the sparse SDPA format, 
which allows for explicit block structure in the constraint matrices to be exploited by the 
solver. 

In the case of error correcting codes (tables 1,2,3 from Chapter [i]), all solutions pro- 
duced by the solvers have been examined by a perl script to ensure that the produced 
numbers really do give valid upper bounds on error correcting codes. In none of the in- 
stances this has made a diference for the final bound obtained. This was done as follows. 

The original problem was to maximize '^- (")(<? — 1)*Xj'q, given certain constraints 



«j 



on the variables x^'^ (4.27). By changing the sign of the objective vector, we obtain a 



semidefinite program of the following form: 

minimize XiCi + ■ ■ ■ + XmCm (7.5) 

subject to xiFi + ■ ■ ■ + XmFm — -fo =: -^ ^ 0, 

where we minimize over x'^ = {xi, . . . ,Xm), c""" = (ci, . . . , c^) is the objective vector and 
Fq, . . . , Fm are given symmetric matrices. The SDP solver not only returns a solution to 
this (primal) problem, but also to its dual: 

maximize (Fq, Y) (7-6) 

subject to (Fj, y) = Cj, i = 1, . . . ,m, 

Y yo. 

Any genuine feasible matrix Y for the dual problem gives a lower bound on the minimum 
in the primal problem, and hence an upper bound for our coding problem. However, the 
produced dual solutions Y usually do not exactly satisfy the linear constraint, but satisfy 

(F„F) = Q + e,, (7.7) 

for small numbers ei, . . . ,6^- In all cases we did find that Y y was satisfied. This 
yields a lower bound on the optimum of the primal program as follows. Let (x, X) be an 
optimal solution for the primal program with value O. Then we obtain: 

(Fo,r) = {x,F, + --- + XmFm-X,Y) (7.8) 

< {xiFi + ■ ■ ■ + XmFm,Y) 
= Xi(Fi,y) + --- + x^(F„,F) 

= Xi(Ci + ei)H \-Xm{Cm + (^m) 

= + (xieiH \-Xmem)- 



The numbers ei, . . . , e^ are easily calculated from the solution Y. Although the numbers 
Xi, . . . ,Xm are not known, we can say that Xi G [0, 1] in this case, which allows us to 
bound the error term by 

(xiei H h Xmem) < max{0, ej H h max{0, em}- (7.9) 

This gives lower bound on O, and hence an upper bound on the maximum size of a code. 
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Samenvatting 



Dit proefschrift gaat over foutcorrigerende codes en overdekkingscodes. Een code is een 
collectie woorden van dezelfde lengte n met letters uit een alfabet q = {0, 1, . . . , g — 1} 
bestaande uit een g-tal symbolen. In het geval dat q = 2 bestaat elk woord uit een rijtje 
van n nullen en enen. We spreken in dat geval van een binaire code. Voor g > 3 spreken 
we van niet-binaire codes. De Hamming afstand (i(x, y) tussen twee woorden x en y is 
gedefinieerd als het aantal posities waarin zij verschillen. Zo krijgt de verzameling q" van 
alle woorden de structuur van een metrische ruimte, de Hamming ruimte. 
Een centrale vraag in de tlieorie van foutcorrigerende codes is: 

Gegeven een 'minimum afstand' d, wat is het maximale aantal woorden in 
een code als we eisen dat van elk tweetal woorden de onderlinge afstand ten 
minste d moet zijn? 

Dit maximum, aangegeven met Aq{n,d) heeft een mooie 'meetkundige' interpretatie in 
het geval dat d = 2e + 1 oneven is. Het getal Aq{n, d) is dan precies het aantal bollen 
van straal e dat binnen de Hammingruimte kan worden gestapeld. De foutcorrigerende 
eigenschappen van zo'n code volgen uit het feit dat wanneer een codewoord in hoog- 
stens e posities wordt gewijzigd, het originele woord weer terug wordt gevonden door het 
dichtsbijzijnde codewoord te nemen. 

Een twee vraag, die duaal is aan de vorige, speelt een rol in onder andere datacom- 
pressie: 

Gegeven een 'overdekkings straal' r, wat is het minimale aantal woorden in 
een code als we eisen dat ieder woord afstand ten hoogste r tot een woord in 
de code heeft? 

Dit minimum, aangegeven met Kq{n,r) is het aantal bollen van straal r dat nodig is om 
de hele Hamming ruimte te bedekken. 

In het algemeen zijn de getallen Aq{n, d) en Kq{n, r) erg moeilijk te bepalen en slechts 
weinig waarden zijn bekend. Daarom is het interessant om goede onder- en bovengrenzen 
te vinden voor deze getallen. Het meetkundige beeld van het stapelen van en overdekken 
met bollen geeft al een bovengrens voor Aq{n,2r + 1) en een ondergrens voor Kq{n,r) 
door het volume van de gehele Hamming ruimte te delen door het volume van een bol 
van straal r. 

In dit proefschrift geven we nieuwe bovengrenzen voor Aq{n,d) en nieuwe ondergren- 
zen voor Kq{n, r) met behulp van semidefiniete programmering. Een belangrijke rol wordt 
gespeeld door een expliciete blokdiagonalisatie van de Terwilliger algebra van het Ham- 
ming schema. Deze maakt het mogelijk om de grote symmetriegroep van de Hamming 
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ruimte te benutten, zowel voor het verkrijgen van scherpere grenzen, als voor het ef- 
ficient kunnen bepalen van deze grenzen. De beschreven methode voor het begrenzen 
van Aq{n,d) werd door Schrijver geintroduceerd voor het binaire geval in [5S]. In het- 
zelfde artikel werd ook een blokdiagonahsatie gegeven voor de Terwilliger algebra van 
het binaire Hamming schema. Een centraal resuhaat uit dit proefschrift is een expliciete 
blokdiagonahsatie van de Terwilliger algebra van het niet-binaire Hamming schema. 

In hoofdstuk 2 brengen we de benodigde theorie in herinnering. In het bijzonder 
stippen we de krachtige methode van Delsarte [15] aan, waarmee met behulp van as- 
sociatieschemas bovengrenzen voor Ag{n, d) te verkrijgen zijn door middel van lineaire 
programmering. Het idee is om te kijken naar de afstandsverdeling 1 = xq, Xi, . . . , x„ van 
een code, waar Xi het gemiddeld aantal codewoorden op afstand i van een codewoord is. 
De getallen Xi voldoen aan bepaalde lineaire ongelijkheden. De eerste soort ongelijkheden 
heeft een directe combinatorische betekenis: de getallen Xi zijn niet-negatief en Xj = als 
er geen twee woorden zijn op afstand i. De andere ongelijkheden, met coefiicienten gegeven 
door de Krawtchouk polynomen, hebben een diepere betekenis. Zij weerspiegelen het feit 
dat de corresponderende lineaire combinatie A := xoAo + - ■ ■+XnAn van associatiematrices 
van het Hamming schema positief semidefiniet is. Dat het positief semidefiniet zijn van A 
kan worden teruggebracht tot een n + 1 tal lineaire ongelijkheden, is het plezierige gevolg 
van het feit dat de Bose-Mesner algebra behorende bij het Hamming schema commutatief 
is, en daardoor in diagonaalvorm kan worden gebracht. 

Een van de ideeen achter het onderhavige werk, is om naar de verdeling van drietallen 
codewoorden te kijken in plaats van naar paren. Dit leidt tot de bestudering van een 
verfijning van de Bose-Mesner algebra in Hoofdstuk 3. De algebra bestaat uit alle ma- 
trices die invariant zijn onder die automorfismen van het Hamming schema H{n,q), die 
een gekozen woord vasthouden. Er is een basis van 0-1 matrices die geparametriseerd 
wordt door de mogelijke configuraties van drietallen woorden modulo automorfismen. We 
laten zien dat de algebra overeenkomt met de Terwilliger algebra [39] van het Hamming 
schema. Deze Terwilliger algebra is niet langer commutatief en kan daarom niet worden 
gediagonaliseerd. Het analogon voor niet-commutatieve algebras is een blokdiagonahsatie, 
waarbij de algebra bestaat uit alle matrices met gegeven blok-diagonaal structuur. Een 
centraal resultaat van dit proefschrift is een expliciete blokdiagonahsatie van de Terwilliger 
algebra behorende bij het niet-binaire Hamming schema. Hoewel het positief semidefiniet 
zijn van een matrix in de Terwilliger algebra niet langer kan worden geformuleerd door 
een klein aantal lineaire ongelijkheden, geeft de blokdiagonahsatie toch een handzame for- 
mulering in termen van het positief semidefinitiet zijn van een klein aantal kleine matrices 
(het aantal is O(n^) en de grootte 0{n)). 

In hoofdstuk 4 geven we een verscherping van de Delsarte grens voor codes. Met behulp 
van de expliciete blokdiagonahsatie van de Terwilliger algebra uit hoofdstuk 3, kan deze 
grens efficient worden bepaald middels semidefinite programmering. Voor g = 3, 4, 5 levert 
dit computationeel een reeks verscherpingen op voor bekende bovengrenzen voor Ag{n, d). 

In hoofdstuk 5 beschouwen we overdekkingscodes en geven we nieuwe ondergrenzen 
voor Kq{n,r). Veel bestaande grenzen voor Kq{n,r) zijn gebaseerd op de afstandsverde- 
ling Aq{x.), . . . , A„(x) van de code C gezien vanuit een woord x. Hier is Aj(x) het aantal 
woorden in C op afstand i van x. ledere lineaire ongelijkheid in Aq, . . . ,An die voor de 
afstandsverdeling vanuit ieder woord x geldt, geeft een ondergrens voor Kg{n, r). De voor 



de hand liggende ongelijkheid 

Ao + Ai + --- + A, > 1 (7.10) 

leidt op deze inanier tot dezelfde grens ('sphere covering bound') als het volume argument 
als boven. Vanuit polyhedraal oogpunt optimahseren we een hneare functie over een 
een polytoop PC [0, 1]''" binnen de eenheidskubus, met een groot aantal symmetrieen, 
namehjk de symmetrieen van de Hamming ruimte q". Met behulp van de theorie van 
matrix snedes |32] kunnen we P vervangen door een kleinere convexe verzamehng, en 
daarmee scherpere grenzen voor Kq{n,r) vinden. Om deze grenzen efficient te kunnen 
bepalen met hneaire en semidefiniete programmering, is wederom de blokdiagonahsatie 
van de Terwilhger algebra van het Hamming schema van groot belang. Computationeel 
levert deze methode voor g = 3 en g = 4 een aantal verscherpingen op ten opzichte van 
de ondergrenzen voor Kq{n,r) uit de literatuur. 

In hoofdstuk 6 brengen we deze theorie van matrix-snedes in herinnering en bestuderen 
we de relatie tussen de nieuwe grenzen voor Ag{n,d) en deze theorie van matrix snedes. 
In het bijzonder blijkt dat de grenzen voor Ag{n, d) uit hoofdstuk 4 scherper zijn dan die 
afkomstig van het toepassen van matrix-snedes op het 'theta-body'. Dit is (vooral) te 
danken aan extra relaties die voortvloeien uit de aanwezige symmetrieen. 



SAMENVATTING 



Dankwoord 



Met groot genoegen inaak ik hier van de mogelijkheid gebruik om een ieder te bedanken 
die mij gedurende mijn promotietijd heeft gesteund, met dit proefschrift als resultaat. 

Zonder mijn promoter, Lex Schrijver, was dit proefschrift er niet geweest, en zou ik 
geen promotieonderzoek hebben gedaan in de combinatorische optimalisatie. Graag wil ik 
hem bedanken voor zijn continue steun en vertrouwen, ook wanneer ik dat laatste schier 
verloren had. Ik ben bhj dat ik zo veel van hem heb kunnen leren. 

Dit boekje heb ik opgedragen aan Violeta, mijn partner en moeder van onze twee 
kinderen Amber en Mark. Hoewel ik soms te weinig tijd voor hen vrij maakte, heeft 
Violeta mij ahijd gesteund. Mark en Amber hebben mij ahijd weer weten te verleiden 
om mijn werk even opzij te schuiven. 

Veel heb ik ook te danken aan mijn ouders, en ook mijn schoonouders. Zij hebben mij 
op zoveel verschillende manieren beinvloed en ondersteund. 

Voor de prettige werksfeer dank ik mijn collegas aan de UvA, in het bijzonder mijn 
kamergenoot Pia Pfiuger. Ook aan het CWI heb ik het enorm getroffen met fijne collegas. 
Ik bedank Monique Laurent voor de gesprekken over het onderwerp van dit proefschrift. 
Ook wil ik mijn kamergenoot Gabor Maroti bedanken. Zonder hem was het vast niet 
gelukt om Theory of Integer and Linear Programming zo grondig te bestuderen. Ook 
ging er geen keer voorbij dat hij geen nice puzzle had. Ik vermoed dat zijn Hongaarse 
achtergrond hier deels verantwoordelijk voor was. 

Graag wil ik ook Chris Zaal noemen. Ik herinner mij goed de SET-workshop die we 
aan het APS gegeven hebben, de opnamen van de Nationale Wetenschaps Quiz en diverse 
andere creativiteiten op het gebied van wiskunde-promotie. In het bijzonder heeft hij mij 
geintroduceerd bij Pythagoras, waar ik nog steeds de problemenrubriek mag vullen. 

Ik dank ook Marco Zwaan. Via hem heb ik mogen proeven hoe het is om wiskundel- 
eraren te onderwijzen aan de eerstegraads opleiding, een bijzondere ervaring. 

Tenslotte dank ik de iedereen die ik niet met name heb genoemd, maar aan wie ik 
niettemin fijne herinneringen dank. Bedankt! 



89 



90 



Curriculum Vitae 



Dion Camilo Gijswijt is geboren op 20 maart 1978 te Bunschoten. Al vroeg had hij 
grote interesse in natuurkunde en andere exacte wetenschappen. Tijdens de eerste jaren 
van zijn middelbareschool-tijd groeide zijn interesse voor kunstmatige intelligentie en 
wiskunde. Het boek Godel, Escher, Bach van D.R. Hofstadter had daarin een groot 
aandeeL Uiteindehjk gaf zijn deelname aan de International Wiskunde Olympiade in de 
laatste jaren van de middelbare school de doorslag om wiskunde te gaan studeren. 

In 1996 slaagde hij voor het eindexamen VWO aan het Goois Lyceum in Bussum en 
begon aan een dubbele studie wiskunde en natuurkunde aan de Universiteit van Amster- 
dam. Na in 1997 zijn propedeuses wiskunde en natuurkunde cum laude te hebben behaald, 
besloot hij zich te concentreren op de studie wiskunde. Tijdens zijn studie besteede hij 
een groot deel van zijn tijd aan het tijdschrift Pythagoras, dat juist onder de bezielende 
leiding van Chris Zaal nieuw leven was ingeblazen. Nadat hij in 1996 begon met het 
vullen van de problemenrubriek, werd hij in 1998 tevens redacteur. 

In augustus 2001 behaalde hij zijn doctoraaldiploma (cum laude) na het schrijven van 
een scriptie getiteld The Colin de Verdiere Graph Parameter fi onder begeleiding van Lex 
Schrijver. In September 2001 begon hij, eveneens onder supervisie van Lex Schrijver, aan 
zijn promotie-onderzoek aan het KdVI aan de Universiteit van Amsterdam. Dit onder- 
zoek, met als thema Spectral Methods for Graph Optimization and Embedding, mondde 
in juli 2005 uit in het onderhavige proefschrift. 

Gedurende deze vier jaar als promovendus was hij ook in de gelegenheid andere 
activiteiten te ontplooien. Naast het met plezier begeleiden van diverse werkcolleges, 
heeft hij vooral veel geleerd van het doceren van een college grafentheorie aan eerstejaars 
wiskunde studenten, en later aan leraren aan de eerstegraads lerarenopleiding. Ook het 
begeleiden van twee studenten bij het schrijven van hun kandidaatsscriptie was een grote 
ervaring. 

Daarnaast heeft hij zich beziggehouden met activiteiten ter promotie van de wiskunde. 
Naast zijn werk voor Pythagoras, was hij actief als lid van de vraagstukkencommissie 
voor de Nederlandse Wiskunde Olympiade, en was betrokken bij diverse promotionele 
activiteiten waaronder: de Leve de Wiskunde dag, Nationale Wetenschapsdag, voorlicht- 
ingsdagen van de UvA, mastercourse voor wiskundeleraren en workshops voor scholieren 
en leraren. 



91 



